FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT C TGAAGAC T AACAT T T TGT GAAGT TGTAAAACAGAAAACC TGTT AGAAATGTGG T GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTA^TTTGGACATCTGCTGCTTTCATATTT 
T CATACAT TACT GCAGTAACACT CCACCATATAGAC CCGGCT T TACCT T AT ATCAGT GACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATT TATGTT CGTTATAAGCAAGTTCATGCTCTGAGTCC TGAAGAGAACGTTATCATCAAA 
|?k T TAAACAAGGC T GGCCT T G TACT TGGAATACTGAGT TGT T TAGGACT T T C TATTGTGGCAAACT T 

l?l CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
J£ TATATATGTT TGTT CAGACCATC CT T T CC TACCAAAT GCAGCCCAAAAT CCAT GGCAAACAAGT C 

i E 5 



TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTT T TGCACAGT GGCAAT T T T GGGAC TGAT T TAGAACAGAAACTCCAT TGGAACCCCGAGG 
III ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
*W GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 

T GGAT TAACCC T CTAT GACAC T GCACC T TGCCCTAT TAACAATGAACGAACACGGCTACT T TCCA 
GAGATAT T TGA T GAAAGGATAAAATAT T TC TGTAAT GAT T ATGAT T CTCAGGGAT TGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
Iff GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATT T GATAGAT TATTCTAAAGGATATCAT 

P CAAGAAGACTATTAAAAACACCTAT GCCTATACTTTT TTATCTCAGAAAATAAAGTCAAAAGACT 

f $ I 

Ft» ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAA.FI FSYI TAVTLHH I DPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVI IKLNKAGLVLGILSCLGLS IVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMI TTAAEWSMS FS FFG FFLT YI RDFQKI SLRVE ANLHGLTLYDTAPCP INNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGC^GACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTT CGCCATGAGTT TCCT CATCGACT CCAGCAT CATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CAT AAGC AAAAAGAAAAGGAT GGCAAT GG CAC G GAGAACAAT GTT CC AGAAGG GG GAAGT G CAT AACA 
AACCATCAGGT TT CTGGGGAATGATAAAAAGT GTTAC CACTT CAGCATCAGGAAGTGAAAATCT TACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATAT GCT ACCAAGGAGAGAATAGAATACT CCAAAACCTTCAAGGGGAAATAT TTTAATT T TCTT GGT T 
Ul ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
M GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 



III 

M 

Hi 



w 



GAAGTTT TGGT CCCAACACATTTCCTT CATTCT TGT T GGAATAATCATCGT CACATCCAT CAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCAC CTTGAACT TAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAA.CAAAATGCTATGGTAGC 
ATT T TTCACCT TCATAGCAT ACT CCTTCC CCGT CAGGTGATACTAT GAC CAT GAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CC CGT GTGGATAT GAGGCTGG 
T GT AGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACAC T GGAAC T CTGGGGCAAGACATGT 
C TAT GGT AGCT GAGCCAAACACGTAGGAT TTC CGTTT TAAGGTTCACATGGAAAAGGTTATAGC TTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACT CTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 



MS FLIDSS IMI TSQILFFGFGWLFFMRQLFKDYEIRQYWQVI FSVTFAFSCTMFELI IFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDWFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSWIVLLLAQIMGMY 

fr* FVSSVLL IRMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALSS I L FL YLAHKQAPEKQMAP 

M 

n 

Important features: 
i 8 l Signal peptide: 

Iff amino acids 1-23 

fi\ 

Potential transmembrane domains: 

^ amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 

?S 425-444 

hi ■ 

s 

ill N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

ru 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAAT C CGGATGT CT CGGT TA-TGAAG T GGAGCAGTGAGTGT GAGC CTCAACATAG T TCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCAT GCAGAAT TAT GGGGATCACC CT TGTGAGCAAAAAGGCGAACCAGCAGCT GAAT T T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
H ATT ATCACCACCAAAGATCCCAT AT TCAACAC T CAAACTGCAACACAAACAACAGAAT T TAT TGT 

Q CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
J*i CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
J}1 GAAAC TAGCACCAT GT CTACAGAAACT GAACCAT TT GT TGAAAATAAAGCAGCAT TCAAGAATGA 

f I AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCT.TCTTTGGTGCTG 
ill CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
111 CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
[,I T GAGGAAT CAAAGAAAAC TGATAAAAACCCAGAAGAGT CCAAGAGT C CAAGCAAAACTACCGTGC 

GAT GCC TGGAAGCT GAAGTT TAGAT GAGACAGAAAT GAGGAGACACACCTGAGGCTGG T T T CT T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
* CCC T T GGT T CCTAACT GGAATCAGCTCAGGAC TGCCAT TGGAC TATGGAG T GCACCAAAGAGAAT 

Q GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
Iff TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 

CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGC CCCTGAAAGCCATGGAGATT C T CATAACT TGAGACCTAATC TCTG T AAA 
GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
ACAGACAGGGT CAAAGTGTT TT CTCTGAACACAT TGAGT T GGAATCACT GT T TAGAACACACACA 
CT T ACT T T T TCT GGTC TC TACCACT GCTGATAT T TT C T C T AGGAAATATAC T T T TACAAGTAACA 
AAAATAAAAACT CT TATAAATT T C TATT TT TATC TGAGTT ACAGAAATGAT TAC TAAGGAAGAT T 
AC T CAGTAATT T GT TT AAAAAGTAATAAAAT T CAACAAACAT TTGCT GAAT AGC TACTATAT GTC 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT T T T T GC T GAG AC T AAT C T TAT T CA T T T T C T C T AAT AT GG C AAC CAT T AT AAC C T T AAT T 
TATTATTAACATACCTAAGAAGTACATTGTTACCTCTATATACCAAAGCACATTTTAAAAGTGCC 
AT TAACAAATGTATCACTAGCCCTCCTTTT TCCAACAAGAAGGGACT GAGAGATGCAGAAATATT 
TGT G AC AAAAAAT T AAAGC AT T T AG AAAAC T T 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTSAKEACRLLG 
LS LAGKDQVE TALKAS FE TCS YGWVGDG FWT S RI S PNPKC GKNGVGVL I WKVPVSRQFAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCT GCTA 
AAGCATCATCAGAAG T GAACC T GGC AAAC T TACC TC CCAGCTAT CACAATGAGACCAACACAGAC 
ACGAAGGT TGGAAATAATACCAT CCAT GT GCACC GAGAAAT TCACAAGATAACCAACAACCAGAC 
T GGACAAATGGT CT T TTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACGAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
III ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
111 CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
t|| CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
gg GAGATCCTGCTGCC CAGAGAGGT CC CCGATGAGTAT GAAGTTGGCAGC T T CAT GGAGGAGGT GCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGC T GGGAGGGGAAGAGATT TAGATCTGGACCAGGC T G T GGGTAGATGT GCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
ill GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
Q TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
| ; || TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCGTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATT GACAC TGTCCC TCTT TGGCAGT TGCAT TAGTAACT T T GAAAGGTATAT GAC TGAGCGTAGCA 
TAC AGGT TAACC TGCAGAAACAGTAC T TAGGTAAT T GT AGGGCGAGGATTATAAATGAAAT T TGC 
AAAATCAC T TAGCAGCAACTGAAGACAAT TATCAACCACGTGGAGAAAAT CAAACCGAGCAGGGC 
TGTGTGAAACAT GGT T GTAATAT GC GAC T GCGAACAC T GAAC TC TACGCCAC T CCACAAATGAT G 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT TGTATAAGCATGC TT TC T T TGAGT TT TAAAT TAT G TATAAACATAAGT TGCATT TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 



CIS 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RS AVEEMEAEEAAAKAS SEVNLANLPPS YHNE TNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site, 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site, 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAG T GC TGCCT CT GGTGTCTGGAAAAAT T TAT CAAG TTCC TAAACCGCAAT GCATACATCATGA 



D TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 



ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 



C.| ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 



GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTGCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 

fl* AGAT TC T GGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC CGG 

lit 

CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

F|i CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

Q TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

t\l AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 

GCTACTCGGGAGGGTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 

GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 

AAGAT TT TAT TAAAGATAT TT TGT T AACTC 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFASFYWAFHKPQDI PTFPL I SAFIRTLRYHTGS LAFGAL I LTLVQ I ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRWVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites . 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

III ccaaggtccaggacgcccagcccaactcgggtc^ 

TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCC T GAT GCAGACCGAGGAGTGCCCACCTATGC TAGACGCCAC ACAGCAGCAGCAGCAGCAGGT 
rjj GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
P ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
III ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

CCTCTACCTGTGGACCCTGGTAGCGCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 



M 
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* 

ill 
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FIGURE 12 



£3 

||I Signal sequence 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVI TLYTMFVTWSALSS I PEQKCNPHLPTQLGNETWAGPEG YETQWWDAPS I 
VGLIIFLLCTLFISLRSSDHRQWSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLHVMMTLTISW^^ 
FS 



amino acids 1-20 



Transmembrane domains: 

g amino acids 40-58, 101-116, 134-150, 162-173, 206-223, 240-257, 

C! 272-283, 324-340, 391-406, 428-444 

ill 

CI 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGG ATGA ACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAAT GGAGGCAT T GAGAACACAT TAGAGAAGGAGGT GAT GCAGTAT GAC T AC TATTCT T CATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACT GCGCCAT T GGT GGGCAATAGCGT T GACAACGGC AGTGACCAG T GCCT T T T TACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGAC TCC TGAT AGT T CAGGAT GCT TCAGAGAGGGCAGCAC TTATACCTGGT GGTC T T TCT GAT G 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACT T TT AGAAC TATGAGTACTACT T T TGT TAAATGTGAAAAAC CCTCACAGAAAGT C 
ATC GAGGCAAAAAGAGGCAGGCAGT GGAGTCTCCCT G TCGACAGTAAAGT T GAAATGG TGACGT C 
CACTGCTGGCT T TAT T GAACAGC TAATAAAGAT T TAT TTAT TG TAATACCT CACAAAC GTT GTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
T CAGT GAGAC TGAGCCTGATGTGT TAACAAATAGGT GAAGAAAGTC T TGTGC TGTAT TCCTAATC 
AAAAGACT TAATATATTGAAGTAACACT TT TT TAGTAAGCAAGATACCT T T T TAT TTCAATT CAC 
AGAAT GGAAT T TT T T T GT T TCAT G T C T CAGAT T TAT T TTGTAT T TCT TTT T TAACACTCT ACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CT T GCAAAAGGGGAAGAAAGGAAT T GC GAATACATGTAAAATG TCACCAGACAT T TGTATTATT T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGAC T T AAAC CAT TCATATCATGT TTCC TT T GCGT TCAGCCAAT T TCAAT TAAAAT GAA 
CTAAAT TAAAAA 
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FIGURE id 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNVNGGIENTLEKEVMQYDYYSSYF 

AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETOFLDFKVLPQEM1EENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 




N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



Transmembrane domains: 



amino acids 54-72, 100-118, 130-144, 146-166 




HI 
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FIGURE l* 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGGCTGCTGCl'GCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
G GCC TGTAC TGT GAGAGCCAGAT GGGGCAGGGGACACGGC CCAGC CC TACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCT GTGTGAC TC TAGTCT TGGC CC CAGGAAGCGAAGGAACAAAAGAAACT GGAAAGGAAGAT GC 
T T TAGGAACAT GTTTTGCTTTTT TAAAATATATAT AT T TATAAGAGATCCT T TCCCAT T T AT TCT 
GGGAAGATGT T T T T CAAAC TCAGAGACAAGGACT T T GGTT T TTGTAAGACAAACGATGAT AT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGIJU^QELDVSNLSLQALPGDLSGLFPRLRLiU^^^ 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
ft VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
f3 T VT QLR PNAT Y S VC VMP L G PGRVPE GEE AC GEAH T P PAVH S NHAP VT QARE GNL PL L I APALAAV 

LLAALAAVGAAYCVRRGRMAAAAQDKGQVGPGAGPL^ 
CE VPLMGFPGPGLQS PLHAKPY I 



m 
w 

»! 



Important features: 



S Signal peptide: 

en 

*** amino acids 1-23 

III 

K Transmembrane domain: 

Ri amino acids 579-599 

HI 

P! EGF-like domain cysteine pattern signature, 

fll amino acids 430-442 

Leucine zipper pattern. 

amino acids 197-219, 269-291 

N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCTTAGAT TCCAAGACTAC T T TGACAT CAGATGAG T CAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATAT TTCTTGAT TCAGAAGAATCTGA 
ATTAGAATCCTCTATT CAAGAAGAGGAAGACAGCCT CAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATATCAGCT TTCTAGAGTCT CCAAATCCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCT TT GACCGCCAT TGAAGGCACAGCACATGGGGAGCCCTGCCAC TTCCCTTT TCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 

t 5 :| 

fj| C TACAACC TATGACTACAAAGCAGATGAAAAGTGGGGCT T TTG TGAAACT GAAGAAGAGGC T GCT 

If! AAGAGACGGCAGAT GC AGGAAGCAGAAATGAT GT AT CAAAC TGGAAT GAAAATC CT TAATGGAAG 

W CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 

[ I s 

f I CCAAAGCCC TGGAGAGAGTGTCATATGCTC TT T TAT T T GGTGAT TACT TGCCACAGAATATCCAG 

III 

f*% G CAGCGAGAGAGAT G T T T GAGAAGC T GAC T GAGGAAG GCT C TCCCAAGGGACAGACTGCTCTTGG 



CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 
AATAATAT TAACAT CAGAAGAAT TT G T GGTT TATAGCGGCCACAACT T TT T CAGCT T TCATGAT C 
CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACATGGAAT CTACATGTAAATGAAAGTTGGTGGAGTCCACAATTT TTCTTTAAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTGTCAGAATCAT T T T T TACAT TAGAT TATCATAAT T T TAAAAAT T T T TCT TTAG T TT T TCA 
AAATT TTGTAAATGG TGGC TATAGAAAAACAACATGAAATAT TATACAATAT T T TGCAACAATGC 
C CT AAGAAT TG T TAAAAT TCATGGAG T TAT T TGTGCAGAATGAC TCCAGAGAGC TC TACTT T CTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAAC TCAT T T T TAATAAAATTATGTCTAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSIASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
E SSI QEEEDSLKS QEGE SVTEDI S FLE S PNPENKDYEEPKKVRKPALTAIEGTAHGE PCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVY YT FGALGGNL I AHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 



Tyrosine kinase phosphorylation site. 

amino acids 220-228 



h.J N-glycosylation sites. 

Ill amino acids 195-199, 217-221, 272-276 

III 

til 

111 

Q N-myristoylation sites. 

|l! amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

0 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal . 

amino acids 299-303 



Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 10 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTAT TATACAAGAAAGGAG TG TACC TATCACACACAGGGGGAAAAATGCTCT TT T GGG TGCT AGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATT T T T ATCAC TGGATGTGACTCGGGCT T TGGAAACTT GGCAGCCAGAAC TT T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACT TCGTACTGTGCTTCTGGATGT GACCGACCCAGAGAATGTCAAGAGGACT G 
* . CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
g| CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
P CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
Mj TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

rjl TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 

I? I 

ATGCAT TGAACCAGGAT T GT TCAAAACAAACT TGGCAGAT CCAGTAAAGG T AAT TGAAAAAAAAC 
;f| TCGCCAT T TGGGAGCAGCT GTCT CCAGACATCAAACAACAATATGGAGAAGGT TACAT TGAAAAA 

AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
! *J CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
m GCAGAGC TGGC TAAT CCCAAGGCAGT GTGA CTCAGC TAACCACAAAT GTCT CCTCCAGGCTATGA 

Q AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
fit CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCrTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTC TTAT CT AAATAT TAAAAGATAAGTCAAC CCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 



li! 



App_ID=10063538 



Page 171 of 320 



FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPE1WKRTAQWVKMQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
RE P I EVNL FGL I S VT LNMLPLVKKAQGRV INVS S VGGRLAI VGGGYT PSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKI^ 

LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 



j Important features of the protein: 

I** Signal peptide: 

M 

f*| amino acids 1-17 

f 

I „ s 

^ Transmembrane domain: 

m 

amino acids 136-152 

Hi 

g N-glycosylation sites. 

0 amino acids 161-163, 187-190 and 253-256 

If! 

|f s Glycosaminoglycan attachment site. 

1 II 

F-s amino acids 39-42 

III 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCG6GCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT TACTGATTCCCAAATGGATGATG T TGAAGT T GTT TATACAAT TGA 
CAT TGAGAAATATATT CCATGC TAT CAGCT TT T TAGCTTT TATAAT T CTT CAGGCGAAG TAAAT G 
AGC AAGCACTGAAGAAAATAT TATCAAATG TCAAAAAGAATGT GG TAGGT T GGTACAAAT TC CGT 
CGT CATTCAGATCAGATGATGACGTT TAGAGAGAGGCTGCT TCACAAAAACTTGCAGGAGCATT T 
T TCAAACCAAGACCTTGTT TT TCTGCTATTAACACCAAGTATAATAACAGAAAGCTGC TCTACTC 
ATCGAC TGGAACAT TC C T TATATAAACCTCAAAAAGGAC T T TT TCACAGGGTACCT T TAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T TT TAGCC GAGCAGTACAAACAC ACAGCTC TAAATT T T T T GAAGAAGAT GGATCC TTAAAGGAGG 
T ACATAAGATAAAT GAAATGTAT GCTTCAT TACAAGAGGAATTAAAGAGTATATGCAAAAAAGT G 
GAAGACAGTGAACAAGCAG TAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAAT TGA 
GAAAAGGAGAGGAGCACAGAT TCAGGC AGCAAGAGAGAAGAACATCCAAAAAGACCC T CAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATC TGACCT TAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGC TAGTACACCAC 
AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGAT GAAGAAAT TGAAAAGAT GAAGGGTT T TGG TGAATAT TCACGGT CT C 
CTACATTT TGAT CCT TTTAACCT TACAAGGAGATTTT TTTATTTGGCTGATGGGTAAAGCCAAAC 
AT TT CTAT TGT T T T TACTATGT T GAGCTAC T TGCAGTAAGT TCAT T T G TT T T TACTATG TT CACC 
TGTT TGCAGTAATACACAGATAACT CTTAGTGCATTTACTTCACAAAGTACT TTTTCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGT CAGGAGGAT TGAT TGAAGCTAGGAGT TAGAGACCAGCCTGGGCAACGTAT T 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTATTTTCAAAATATGGAAA 
GAAATTTATATGAAAAT T TATC T GAG T CAT TAAAAT T CTC C T TAAGT GATAC TT T T T TAGAAGTA 
CATTATGGCTAGAGT TGCCAGATAAAATGCTGGATATCATGCAATAAATT TGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEViraQALKKILSNVKKN^ 

VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QI QAAREKNI QKDPQENI FLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE I EKMKGFGE YSRS PTF 




amino acids 1-19 



Signal peptide: 



Important features: 




N-glycosylation sites. 



s 



amino acids 75-79, 322-326 




N-myristoylation site. 



Growth factor and cytokines receptors family. 



amino acids 184-154 



amino acids 134-150 
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FIGURE 23 



P 
0 

m 



GGCACAGC CGCGCGGCGGAGGGCAGAGTCAGCC GAGCCGAGTC CAGCCGGACGAGCGGAC CAG CGCAGG GCAG CCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAAC GAT GC C CCC CTC C ACGAGAT CAACGGT GATCATTT GAAGAT CTGT C CCCAGGGT T CTACCTGCTGC T CT 
CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGT AAAGATGATTT CAAAAGT GTGGT CAGCGAACAGTGCAATCAT T TG 
CAAGCT GT CT TT G CT T CACGTTACAAGAAGTTT GAT GAATT CTT CAAAGAACTACTT GAAAAT GCAGAGAAATCCCTG 
AATGATAT GTTTGTGAAGACAT ATGGCCATT TAT ACATGCAAAATTCT GAGCTAT TT AAAGATCT CTT CGTAGAGTTG 



Mi AAACGT TACTACGTGGTGGGAAATGT GAAC CTGGAAGAAAT GCTAAAT GACTTCT GGGCT CGCCT CCT GGAGCGGATG 

If 1 TTCCGCCTGGTGAACT CCCAGT ACCACTT T ACAGAT GAGTATCT GGAATGT GTGAGCAAGT ATACGGAGCAGCT GAAG 

III CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
III TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
£ ATGATCTACTGCTCCCACTGCCGGGGTCT CGTGACT GTGAAGC CATGTTACAACTACT G CT CAAACAT CATGAGAGGC 

T GTTT GGC CAACCAAGGGGAT CT CGATTT T GAAT GGAACAATTT CATAGAT GCTAT GCTGAT G GT GGCAGAGAG GCTA 
III GAGG GTCCT TTCAACATTGAAT C GGT CAT GGAT CCC ATC GATGT GAAGATTTCT GAT GCTATTAT GAACAT GCAGGAT 

M AATAGTGTT CAAGTGT CTCAGAAGGTTTT CCAGG GATGT GGAC C CCCCAAGCCCCT CC CAGCTGGACGAAT TTCT C GT 

111 TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACA^ 

Id AGTTT GGACCGACTGGTTACTGAT GT CAAGGAGAAACTGAAACAGGCCAAGAAATTCT GGT CCT CCCT T C C GAG CAAC 

I'll GTTT GCAAC GATGAGAGGATGGCT GCAGGAAACGGC AAT GAGGATGACTGTTGGAAT GGGAAAGGCAAAAGCAGGT AC 

CTGTT T GCAGTGACAGGAAATGGATTAGCCAACCAGGGGAACAACCGAGAGGTC CAGGT TG^ 

ATACTGATCCTTCGTCAAATCAT GGCTCTT CGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 
GACTT CTTT GATATCAGTGATGAAAGTAGT G GAGAAGGAAGTGGAAGT GGCTGT GAGTATCAGCAGTGCCCTT CAGAG 
T TTGACTACAATGCCACTGACCAT GCTGGGAAGAGT GCCAATGAGAAAGC CGACAGT GCT GGTGTCCGT CCT GGGGCA 
CAGGCCTACCTCCTCACTGTCTT CTGCAT CTTGTTCCTGGTTAT GCAGAGAGAGTGGAGATAATTCTCAAACTCT GAG 
AAAAAGTGTTCAT CAAAAAG TT AAAAGGCACCAGTTATCACTTT TCTACCATCCTAGT GACTTT GCTTTTTAAAT GAA 
T GGACAACAATGTACAGTTTT T ACTATGT GGCC ACT GGTT T AAGAAGT GCT GACTTT GT TTTCT CATT CAGTTTT GGG 
AGGAAAAGGGACT GT GCATT GAGT TGGTT C CTGCTC CCC CAAACCATGTT AAACGTGGCTAACAGT GTAGGTACAGAA 
CTATAGTTAGTTGT GCATTT GT GATTTTAT CACT CTAT TATTT GT TTGTAT GTTTTTTT CT CAT TTCGTT T GT G GGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CTGAAATATTAAAT AGCT GTACAGAAGCAGGT TTTATTTATCAT GTT AT CTTATTAAAAGAAAAAGC C CAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFW^ 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEW 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
E KLKQAKKFWS S LPSNVCNDERMAAGNGNEDDCWNGKGKSRYL FAVT GNGLANQGNNPE VQVDT S 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLL TVFC I LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) ♦ 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 



AppJD=10063538 



Page 176 



FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGC TT C TAGGAGAT GGC TCCAGGAAGGCGGCCAAGAATG TGAGTGCAAAGAT TGGT TCCT GAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATT CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACAT TCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACAGT C 
p! TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
gt AGCTAGTGTCAT T TAACCT TAAATGCAATCAGGAAAGTAGCAAACAGAAG T CAATAAATATT TT T 

E AAATGTCAAAAAAAAAAARAAAAAA 

fair 
fll 

fll 
0 

m 



5f: 
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FIGURE 26 



MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKF'M 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites, 

amino acids 27-33/ 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCC ATG GACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
T GGCCGTGCT GACT CCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGC TCT T CAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AG T T CC T GACAAAGAG CAT GGC T GAGAACAGGCAC C T C CAATAT GAG CGG TTTGTGGTGGCT CC T 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
c T TCCCCAGCC T CCAATTAGAACAAGCCACC CACCAGCCTATCTATCT TCCACTGAGAGGGAC CTA 

Cl GCAGAATGAGAGAAGACAT TCAT GT ACCAC CTAC TAGTCC CTC TCTC CCCAACC TCTGCCAGGGC 

AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 

AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 

ill 

m CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
fll CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACT CAAAGAGGCAGAGG T TT TG T T CT CAAATAT T TT T T AATAAATAGACGAA 
ACCACG 



f : H 
f J? 

I! 



ill 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLSLGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEH^ 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQP I YLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCT TATT TAAA. 
GTGGT TGT T TATGAT T CT TATACTAAT T TATACAAAGATAT TAAGGCCCTGT TCATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAA&AAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 



Growth factor and cytokines receptors family: 



n amino acids 3-18 



5s? s 



III 
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FIGURE n 



Ml 

m 



II! 
r? 

HI 

m 
rii 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCC T TGGCC T GTGTATCC TCGCATTAGCC TTGTCT T T GGCCAT GATGTT TACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
T GT CT GCGGTG T TT TATGGTGGCTGTAT TATGAC TATACCAACGACC TCAGCATAGAAT TGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AAT TGGCC TCAT CT GGACTAGTGAATT CATCCT TGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT TCCGAGAAT CAT T GTCATGT ACAT GCAAAACGCACT GAAAGAACAGCAGCATGGTGCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACTACAACTGC TAT TAAT GGGACAGATT T C TGTACAT CAGCAAAAGATGC 
AT T CAAAATC T TGT CC AAGAAC T CAAGT CAC T T TACAT C T AT TAAC T GC T T T GGAGAC T T CAT AA 
T TT TT CTAGGAAAGG TGT TAG TGGT GT G T T TCAC TGT TT T T GGAGGACT CAT GGC TTT TAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAAC AAAT TAAACAAT GCAAGGGCACAGCAGGACAAGCACT CAT TAAGGAAT GAGGA 
GGGAACAGAACT CCAGGCCAT TGTGAG ATAGA TACCCATT TAGGTATCTGTACCTGGAAAACATT 
T CCT TCTAAGAGCCAT TTACAGAATAGAAGATGAGACCACTAGAGAAAAG T TAG T GAAT T T T TTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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Hi 



FIGURE 32 



MSGRDTILGLCILALALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
I L I FFWVLWVAVLLS LG TAGAAQVME GGQVE YKPLS G IRYMWS YHL IGLIWTSEFI LACQQMT I A 
GAWTCYFNRSKNDPPDHP ILS SLS I LFFYHQGTWKGS FLISWRIPRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFE TVLDAL FLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 



P 

g*l Important features: 

0! Signal peptide: 



amino acids 1-20 



g'*\ Putative transmembrane domains: 

r amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

Ill N-glycosylation sites. 

!!! amino acids 204-208, 295-299, 313-317 



N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 3 ft 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATT TG TATGGGATT T GTGATGCAGGAAAGCC TAAGGGAAAAAGAAT ATTCAT TCT GTG T GGT 
GAAAAT T T TTTGAAAAAAAAAT T GCC T TC T TCAAACAAGGGTG T CATT CT GATAT TTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T CAAACAAAGAAACGGCAAAGAAGAT TAAAAGGC CCAAGT TCACTG TGC CTCAGAT CAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAGT TAT TCCAACGG TGT C CAATCGT TAT CCC TACCACGATGGAGAGAATCCT T 
TAT CGTCT TAGAAAG TAAACC CAAAAAGGG TG TAACC TACCCAT CAGCTCT TACATAC TCAT CAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCC TATCAGAGGCCACCTAT TCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
C AC C T T GC CAAG GC CAT CCCCTTCTGCTGCTTC T AC CACC AGC AT CC C C AGACC AC AAT C AG TG G 
W GCCACAGGAGCCAGGAGATGGAT CT CT GGTCCAC TGCCAC C TACACAAGCAGCCAAAACAGGCC C 

01 AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
ill GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 



□ 



III 

III 

m 



rf? CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 



III 



p 
m 



ACACGAAT TC TCGAGATC TGAAGACAGCCATAGAGAAAAT TAC T CAGAGAGGAGGACT T T CT AAT 
GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
||1 ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGT TT GAGAT TT CCGACACGGACACGCGCATCGGGGCC G TGCAGTACACCTACGAACAGCGGC 
T GGAGTT TGGG T TC GACAAGTACAGCAGCAAGCC TGAC AT CCT CAAC GCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGT TAATGATCCTCAT CACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GTTTGACAACCTCCATCAGTATGTCCCCAGGATCAT CCAGAACATT TGT AC AGAGT TCAACT CAC 
AGCCTCGGAAC TGAA TTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGC TACATCAT G T TGAGGG TGCTGGAGAT T T TACAT 
T T T GACAAT TG T TT T CAAAATAAAT G T T CGGAATACAGTGCAGCCCT TACGACAGGC T TACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
TTTTGTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLS FLIDGSTS IGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVVVVIOTDGWPTDKVEEASRLARESGINIFFITI^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI L I TDGRSYDDVRI PAMAAHLKGVI TYAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 3* 



M 



CCGAGC AC AGGAGATT GC CTGCGT TT AGGAGGTGGCT GCGTTGT GGGAAAAGCT AT CAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCT T T AATTTC AT CT GGAATTCC AC AGTTT TCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTG GATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGAT GT CACTGAGATCCC TCAAAT GGAGCCTCCTGCTGCT GTCACTCCTGAGT T TCTTTGTGATGTGGTAC 
CTCAGCCTT CCCCACT AC AATGTGAT AGAACGCGTGAACT GG ATGT ACTT CT AT GAGT AT GAGCCG ATTT A 
C AGACAAG ACTT TC ACT T C ACACT TC GAGAGC ATTCAAACTGCT CTCATC AAA&TCCATT TC TGGTCATT C 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGT GGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCT GAAAAGGAAGACAAAATGTTGGC 
|fl ATT GTCCTT AGAGGAT GAAC ACCT TCT T T ATGGTGACAT AATCCGACAAGAT TTTT TAGACAC AT ATAAT A 

III ACCT GACCTTGAAAAC C ATT ATGGCAT T C AGGTGGGTAAC T G AGTTTT GC CC C AATG CCAAGT ACGT AAT G 

m 

W S AAGACAGAC ACTGATGTT TTCATC AATACTGGCAAT TT AGT GAAGT AT CTTTT AAACCTAAACCACTCAGA 

US 

GAAGTT T TTC AC AGGT T ATCCTCT AAT T G ATAAT T ATT CCT AT AGAGGAT TTTACCAAAAAAC CC AT ATTT 

H\ ■ 

-~ CTTACC AGGAGT ATCCTT TC AAGGTGT T CCCTCC AT ACTGC AGTGGGT TGGGT T AT ATAATGT CCAGAGAT 

j|U TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
1 1| CTGT TT GAATTT ATTAAAAGTGAACATT C ATATTCC AGAAGAC ACAAATCTTT T CT T TCT AT ATAG AATCC 

P| ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
jrfj TGGCAGGTCATGCTAAGGAACACCACATGCCATTAT TAA CTTCACATTCTACAAAAAGCCTAGAAGGACAG 
|J| GAT ACCT TGTGGAAAGTGTTAAATAAAGT AGGT ACT GT GGAAAATTCATGGGGAGG T CAGTGT GCTGGCTT 
f|| AC ACTGAACT GAAACT C ATGAAAAACCCAGACTGGAGACTGG AGGGTT AC ACTT GTG ATT TAT TAGTC AGG 

CCCT TC AAAGATGATATGTGGAGGAATT AAAT AT AAAGGAAT TGGAGGTT T TTG CT AAAGAAATT AATAGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CTAGGCT GTAAAAACAAAAC AATGTAGAGTTTT ATT TATT GAACAATGTAGTCACT TGAAGGTTTTGTGT A 
TAT C TT ATGTGG ATTACCAATT T AAAAAT ATATGTAGT T C TGTGTCAAAAAACTTCTTCACT GAAGTT AT A 
C T GAAC AAAATT T TACCT GTTT TT GG TC ATTT AT AAAGTACT TCAAGATGTTGC AGTATT TCACAGTTAT T 
AT TATT T AAAAT T ACT T CAACT TT GT GT TTTT AAAT GT T TTGACGAT T TCAAT ACAAGAT AAAAAGGAT AG 
TGAATC ATTCTT T ACAT GC AAACATT TT CC AG TTACTT AACTGATC AGTT TAT T ATTG AT ACATC ACTCC A 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAG AATT AAAGCAAGAAAAT CTG AAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQATRVTWGEKKSTOGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVWGICLNLLKWIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 27 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACT1TTCTTCAGCTCCTTCTCATC 
TC GT C CT T GCCAAGAGAGT AC AC AGTCAT TAATGAAGCCTGCCCTGGAGCAGAGTGGAAT ATCATGTGTCGGGAGTGC TGTG 
AATAT GAT CAGATTGAGTGCGTC T GCCCCGGAAAGAGGGAAGTCGTGGGTTATACC ATCCC TTGCTGCAGGAATGAGGAGAA 
T GAGTGT GAC TC CTGC C TGATCC AC CCAGGT TGTACCATCTTTGAAAAC TGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACC T T GGAT GAC TTC TATGTGAAGGGGT TCTAC TGTGCAGAGTGC CGAGCAGGC TGGTACGGAGGAGACTGCATGCGATGTG 
GCCAGGT T C TGCGAGCCCC AAAGGGTCAGAT TT TGTTGGAAAGC TAT CC CC TAAAT GCTC ACTGTGAATGGACCATTCATGC 
TAAAC CT GGGTT TGTCATCCAAC TAAGAT TTGTCATGTTGAGTCTGGAGTT TGAC T ACATGTGCCAGTATGACTATGTTGAG 
GTT C GTGAT GGAGAC AACCGCGATGGCCAGATCATCAAGCGTGTC TGTGGCAACGAGCGGCCAGCTCC TAT CCAGAGCATAG 
s . GATCCTC^CTCC^CGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATG^CATTTATGAGGAGATCACAGC 
^ ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 

'HI TATACTGGGCAGCGC TGTGAAAATCTCC T TGAAGAAAGAAACTGC TC AGAC C C TGGGGGCCCAGTCAATGGGTAC CAGAAAA 

I A 

f J % TAACAGGGGGCC C TGGGCT TATC AACGGACGCCAT GC TAAAAT TGGC AC CGT GGTGTC T T TCTTTTGTAACAACTCCTATGT 

j ! * ! TCT TAGT GGCAAT GAGAAAAGAACT TGCCAGCAGAATGGAGAGTGGTC AGGGAAACAGCCCATCTGCATAAA^ 

* GAACCAAAGATTTCAGACC TGGTGAGAAGGAGAGTTCTTCCGATGCAGGTTCAGTCAAGGGAGACACCATTACACCAGC TAT 

ACTCAGC GGCC T T CAGCAAGC AGAAACTGCAGAGT GCCCCTACCAAGAAGC C AGCCC TTCCCTTTGGAGATC TGCCCATGGG 
J;.; ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
C TGAGGAC TGGGAAGT GGAGTGGGCGGGCACCATC C TGCATCCCTATC TGC GGGAAAAT TGAGAACATCAC T GC TCCAAAGA 
Si CCCAAGGGTTGC GC TGGCCGTGGCAGGCAGCCATC TACAGGAGGACC AGCGGGGTGCATGACGGCAGCCTACACAAGGGAGC 

M GTGGT TC C TAGT CTGCAGCGGTGCCCTGGTGAATGAGCGC AC TGTGGTGGT GGC TGCCCAC TGTGTTACTGAC CTGGGGAAG 

II 1 GTCAC CATGATCAAGACAGCAGACC TGAAAGTTGT T T TGGGGAAATTC T AC C GGGATGATGACCGGGATGAGAAGACCATCC 

III AGAGCCTACAGATT T C T GC TATCATTCTGCATCCCAACTATGACCCC ATCC T GC T TGATGC TGACATCGCCAT CC TGAAGC T 

HI CCTAGACAAGGC C CGTATCAGCACCCGAGTCCAGC CCATC TGCCTCGC TGC C AGTCGGGATCTCAGCACTTC CTTCCAGGAG 

|3 TCCCACATCAC T GTGGC TGGC TGGAATGT CC TGGCAGACGTGAGGAGCC CT GGC T T CAAGAACGACACACTGC GCTCTGGGG 

||J TGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGCAGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTT 
C TGTGCCAGCTGGGAAC CC AC TGCCCC TT C TGATATC TGCACT GCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCATCTCCT GAGC CACGC TGGC ATCT GATGGGAC TGGTCAGCTGGAGC TATGAT AAAACATGCAGCCACAGGCTCT C C A 
CTGCCTTCACCAAGGTGCTGCCT T T TAAAGACTGGAT TGAAAGAAAT AT GAA AT GAA CCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGC CAGGGC TT CTGAC TTCAGGGACAAAAC TCAGTGAAGGGTGAGT AGACC TCC AT TGCTGGTAGGC TGATGCC GCGT CC A 
C TAG TAGGACAGC CAAT TGGAAGATGCCAGGGC TTGCAAGAAGTAAGTT TC T TC AAAGAAGACCATATACAAAACCTC T C C A 
C TC CACT GACCTGGT GGTC T TCCCCAAC T T TCAGTTATACGAATGCCAT CAGC TTGACCAGGGAAGATCTGGGCTTCAT GAG 
GCCCCTTTT GAGGC TC TCAAGTTC TAGAGAGC TGCCTGTGGGACAGC C CAGGGCAGC AGAGCTGGGATGTGGTGCATGCC T T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQISCVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ ILLES YPLNAHCEWT I HAKPGFVI QLR FVMLS LE FDYMCQYD YVE VRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DLSTSFQESHITVAGWNVLADTO 

SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWS YDKTCSHRLS TAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAA.TCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTT CTCAGTCAAATTCTTTGT GATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTT GGCT GG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTT GGCT CAGTCCT GCTAACTACATTGACAATGT GGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T C ACAGCCACGGCTC C CTCCCCAGAGGTTT CT GCAGCTGCCAC CATCT CCTTAAT GACAGACGAGCCT GGC CTAGACA 
ACCCT GCCTACGTGT C CTCG GCAGAGGACGGGCAGCCAGCAAT CAGC C CAGTGGACT CTGGC CG GAGCAAC CGAACTA 
GGGCACGGCCCTT TGAGAGATC CACT ATTAGAAGCAGATCATTTAAAAAAATAAATC GAG CTTT GAGT GTT CTTC GAA 
|xi GGACAAAGAGCGGGAGT GCAGTT GCCAACCATGCC GACCAGGGCAGGGAAAATTCTGAAAAC AC CACT GCC CCTGAAG 

p T CTTT CCAAGGTT GT ACCACCT GATT CCAGATGGT GAAAT TAC CAGCATCAAGAT CAATC GAGTAGAT CCCAGTGAAA 

|!*f GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACT GGTCCATATCATTAT CCAACACATTTAT CGTGATGGGG 

pi T GATCGC CAGAGACGGCCGGCT ACTGCCAGGAGACAT CAT T CTAAAGGTCAAC GGGAT GGACATCAGCAAT GTCCC TC 

III ACAACTAC GCT GT GCGT CT CCT GCGGCAGC CCT GCCAGGT GCT GTGGCTGACT GT GAT GC GT GAACAGAAGT TCCGCA 

111 GCAGGAACAATGGACAGGCCCCGGAT GCCTACAGACCCCGAGAT GACAGCT TT CATGT GAT T CT CAACAAAAGTAGCC 

|l| CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
ft# GT G TGGCAT ATC GAC ATGGTCAGCTT GAGGAGAATGACCGT GT GTTAGCCATCAATGGACAT GAT CTT CGAT ATGGCA 

P GCCCAGAAAGTGCGGCTCATCT GATT CAGGCCAGTGAAAGACGTGTT CAC CTC GT CGT GTCCCGCCAGGTTCGGCAGC 

III GGAGCCCT GACATCTT TCAGGAAGCC GGCT GGAACAGCAAT GGCAG CT GGT CCCCAGGGCCAGGGGAGAGGAGCAACA 

CT CCCAAGCCCCTCCATCCTACAATTACTT GTCATGAGAAGGT GGTAAATATC CAAAAAGACCCCGGT GAAT CTCT CG 
C| GCATGACC GT CGCAGGGGGAGCATCACATAGAGAAT G GGATTT GCCTATCTAT GT CAT CAGT G TT GAGCCCGGAGGAG 

I II T CATAAGCAGAGATG GAAGAATAAAAACAGGTGACATTTT GTT GAAT GTGGAT GGGGT CGAACT GACAGAGGTCAGCC 

| ¥ | GGAGT GAGGCAGTGGCATTATT GAAAAGAACATCATCCTCGATAGTACTCAAAGCTTT GGAAGTCAAAGAGTATGAGC 

!|| C C CAGGAAGACT GCAGCAGCCCAGCAGCCC TGGACT CCAACCACAACATGGCCCCAC CCAGT GACTGGTCC CCAT CCT 

GGGTCATGT GGCT GGAATTACCACGGTGCTTGTATAACTGTAAAGATATT GTAT T ACGAAGAAACACAGCT GGAAGTC 
T GGGCTTCT GCAT TGTAGGAGGT TAT GAAGAATACAATGGAAACAAACCTT TTTT CAT CAAATCCATT GTT GAAGGAA 
CACCAGCAT ACAATGATGGAAGAATTAGAT GTGGT GATAT T CTT CTT GCT GTCAATGGTAGAAGT ACATCAGGAAT GA 
T ACAT GCTT GCTT GGC AAGACT GCTGAAAGAACTTAAAGGAAGAATTACT CTAACTATTGT TTCT TGGC CT GGCACTT 
TTTTATAGAATCAAT GAT GGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTATATTTATC 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACACCT CAGAAAATAT GATT CCAAAAAAAT TAAAACTACT AGTTTTTTTT CAGTGTGGAG GATTT CTCATT ACTC T AC 
AACAT TGTTTATATTTTTT CTAT TCAATAAAAAGC C CTAAAACAACTAAAATGATT GATTT GTAT ACC C CACTGAATT 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCATTTTTAATTTACAGCT 
AAAAT ATT TTTT AAAATGCATT GCT GAGAAACGTT GCT TT CAT CAAACAAGAATAAAT ATTT TT CAGAAGT T AAA 



It 
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FIGURE ao 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFfCKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
RE QKFRS RNNGQAPDAYRPRDDS FHVI LNKSS PEEQLGI KLVRKVDEPGVFI FNVL DGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLOTDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTT GGAAGAGGTACAGGGAGAAAAT CCCAAAGT GT CCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTT TACAGAGGGTCGC CATC CTCGT TCCC CACCGGAACAGAGAGAAACACCTGATGTACCTGCT GGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTGAAGGAAGAAAATTGGG 
ACT GCTT TATATTCCACGAT GTGGACCTGGTAC CCGAGAATGACTTTAACC T TTACAAGTGTGAGGAG 



ffi 

jsr- 
ill 

| J] CAT CCCAAG CAT CTGGTGGT T GG CAGGAACAG C ACT G GGT ACAGGT T AC GT T ACAGT GGATAT T T TGG 

111 GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
111 G GG GAGG C GAAGACGAT GAC CTCAGAC TC AG G G T T GAGC T CCAAAGAAT GAAAAT T TCC C GGC CCCT G 

* C CT GAAGTGGGTAAATATAC AATGGTCTT CCACACTAGAGACAAAGGCAAT GAGGT GAACGCAGAACG 

GAT GAAGCT CT TACACCAAGTGTCACGAGTCTGGAGAACAGAT GGGT TGAGTAGTTGTTCTTATAAAT 
TAGTATCTGTGGAACACAAT CCTT TATAT ATCAACAT CACAGT GGAT TT CT GGTTT GGT GCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 



Q ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
f|| TAGCAGAGCTC CTGGT GAT GTAGAGTATAAAACAGT T GTAACAAGACAGCT TTCTTAGTCATTTTGAT 

CAT GAGGGTTAAATAT TGTAATAT GGATACTT GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTATT TGAGGACTCTGGTT GAAGGAGAT T TAT TTAAATT T GAAGTAATATAT TATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACT GTTATTCAT TTAT CCT GTACAATCATCT GTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGT GAACTTGGGAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
C TT CAGGGGAGGACCT GCC CAGGTATGCC TTCCAGT GATGCCCACCAGAGAATACATTCTCTATTAGT 
TTT TAAAGAGT TTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
ATT AACT AATAATAAATATGTCTATCAAATACC TCT GTAGTAAAATGT GAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
I^EKHIMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFimAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCT6GG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACT T GCAGAGC TGCAACCCCAGGACAGAGC TGGAGCCAGGGCCAGC T GGAT GCC CAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAA7^AAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE ±± 



p 



in 



t,2 : 



ru 



MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-2 4 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 



Prokaryotic membrane lipoprotein lipid attachment site. 

£3 amino acids 1-12 
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FIGURE ax 



GTGGCT TCAT T T CAGTGGC TGACT T CCAGAGAGCAA.TATGGCTGGT T CCCCAACATGCCT CACC C 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACT T CCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGACT CAGGGATCTAC TATGT GGGGATATACAGCT CATCAC T CCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
SJ TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
rf'i CTGGAGAT GGGGAGAAAGTGATAT GACC T TCATC TGCGTT GCCAGGAACCCTGTCAGCAGAAAC T 

III TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
G T T T C T G AAG AG AG AG AG AC AAG AAG AG T ACA T T G AAG AG AAGAAGAG AG T G GAC AT TTGTCGGG 

Ml 

AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
p AGAACAAT CCTAAAGGAAGATCCAGCAAATACGGT T TACT CCACTGT GGAAATACCGAAAAAGAT 

If] GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 



III 



fll 



TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE a6 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 

EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 

PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVI YTWKMjGQAANE SHNGS I LP I S WRWGE SDMT FI C 

VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLK^ 

EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 

PRLFAYENVI 



r* Important features: 

~J; Signal peptide: 

f$l amino acids 1-22 

t s 
r:: 

Ill Transmembrane domain: 

W amino acids 224-250 

fill 



ft 
f!J 



Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE ah 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGT TGAGGAAGACCAAT T TTC T CAAAACC CCAT CT CT TGC T T TGAGTGG TGGT TCCCAGGAAT T 
ATAGGAGCAGG T CT GATGGCCAT TCCAGCAACAACAAT G T CCT T GACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTATU^lAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT TGTGAAT TTT CAT TGAAAAACATCAGTGACAT TCATCCAGAATCCTT CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGT GGC TGGAGAGCAT CTAGT T TCCACT T CGAT TCT GAAGAAAACAAACAT AGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TGTAGT TTAATGGGAATAAAATGTAAGTATCAGTAGT TTGAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
T TMS LTARKRACCNNRTGMFL S S FFS VI T VIGAL YCML I S I QALLKGPLMCNS P SNS NANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

^ amino acids 10-31 (type II), 50-72, 87-110, 191-213 

gfl N-glycosylation sites. 

[{I amino acids 80-84, 132-136, 148-152, 163-167 

Si 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

|j| N-myristoylation sites. 

Q amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Ill 

f! 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAAC CTGCAGAGT CACACAGT TGC TGACCAAT TGAGC TGTGAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
C ATGG AGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATT TGCC AATAAAGACGATCCCT T C TAC TAT GAC T GGAAAAACC T GCAGC TGAGCGGACTGAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
01 TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 



III 
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FIGURE so 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
S QKQHS PVPEKAI PL I T PGS AT TC 

Important features: 
Signal peptide: 

amino acids 1-16 



Transmembrane domain: 



amino acids 36-59 




N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 



amino acids 54-67 
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FIGURE 51 



ft 

M B 

t r. 
~: = ~ 

III 



GTGGAC TCTGAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACAC T C TGGAGAGAGAGGGGGC TGGGCAGAGATGAAGT TC CAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

III 

AGC CAGCAACCAGAAT GAAGGGTGCACGAATCCC CCACCATCTGGC T CAGGTGGAGGC TGCAGCA 
P| ACT CTGGGGGAGGCAGCGGCTCACAG TCGGGCAGCAG T GGCAG T GGCAGCAATGGTGACAACAAC 

|R AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
P TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

LI 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T TAAATC CAAGC TGGGT T TCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGC T CT C 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAA?^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 
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FIGURE 52 



P 

13 



MKFQGPIACLLIJUjCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
S SGSS SGS S SGGS SGGS SGGS SGNSGGSRGDS GSE S S WGS S TGS S SGNHGGS GGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISBCEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 



p| N-glycosylation site. 



amino acids 265-269 
Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGAC^GAAGG ATG TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CAC AGCCC CCAAAACG GAAC T GG T T T T GGG GT CACC T GG G C C T GAT C AC T C C TACAGAGGAGG G C 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CCT CAGAGGGCAGCAGT CG TC T GGACATGT T TGAGCACAT CAGCC T CATGACCTTGGACAG T CT A 
CAGAAATGCATCTT CAGCT TTGACAGCCAT TGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CAT TGTCAGATGAGGATATAAGAGC AGAGGCTGACACCT T CAT GT T T GGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 54 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAW 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRI^LTPAFHFNILKSYITIFI^Si^IMLDKWQHLASEGSSRLDMFEHr 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFI SRCCTQDIVLPDGRVI PKGI TCL IDI I GVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAFI PFSAGPRNC I GQAFAMAEMKVVLALMLLHFRFLPDHTE PRRKLELIMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme- iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 



til 

w 



III 

w 



m 

II! 



i ? •? 
fll 



ATCGCAT CAAT T GGGAGTACCATCT TCC T CATGGGACCAGTGAAACAGCTGAAGCGAATGTT TGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CATGGCCAG TT TTATGAAGC T T TGGAAGGCACTATGGACAGAAGCTGGTGGACAGT TT T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATT CGAACAT T TGAGGGT TACT T T TGGAAGCAACAATACAT TCT CGAACCT GAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



AppJD==10063538 



Page 207 of 320 



FIGURE 56 



MGPVKQLKRMFEPTRL IATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWYSLSFIPFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 



tp amino acids 30-72 

i 

III 
I s'-x 



HI 



0 

fit 
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FIGURE S7 

CGGCTCGAGCTCGAGCCGAATCGGCT CGAGGGGCAGT GGAGCACCCAGCAGGCCGCCAACA^TCCT CTGT CTGTGCCTG 
T ACGT GCC GGT CATCGGGGAAGC CCAGACC GAGT TCCAGTACT TTGAGTC GAAGGGGCT C CCTGCCGAGCT GAAGT CC 
ATTTT CAAGCTCAGT GTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGT GGAAGCAGAAAAT T GTACAAGCT 
GGAGATAAGGAC CTT GATG GGCAGCTAGACTTT GAAGAATTTGT CCATTATCT CCAAGAT CAT GAGAAGAAGCTGAGG 
CT GGT GTTTAAGATTTTGGACAAAAAGAAT GATGGACGCATTGACGCGCAGGAGATCATGCAGT CCCTGCGGGACTTG 
GGAGT CAAGATATCT GAACAGCAGGC AGAAAAAATTCTCAAGAGCAT GGAT AAAAAC G GCAC GAT GACCATC GAC T GG 
AACGAGTGGAGAGACT AC CACCTCCTCCACCC C GTGGAAAACAT CCCC GAGATCATCCT CTACT GGAAGCATTC CACG 
AT CTT TGAT GTGGGT GAGAATCTAACGGT CCCGGATGAGTTCACAGT GGAGGAGAGGCAGACGGGGAT GTGGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCTC CCGCAGCAACIAACAT GGGCAT CGTT GGT GGCTT CACTCAGATGATTCGAGAAGGAGGGGCCAGG 
TCACT CTGGCGGG GCAATGGCAT CAACGT C CTCAAAAT TGCCC C CGAATCAGC CATCAAATT CAT GGC CTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT CGC CCAGAGCAGCATCT ACCCAAT GGAGGTC CT GAAGACC CGGAT GGC GCT GCGGAAGACAGGCCAGTACT CAGGA 
AT GCT GGACTGC GC CAGGAGGAT CCTGGC CAGAGAGGGGGTGGCCGCCTT CTACAAAGGCTATGT CC CCAACAT GCTG 
GG CAT CAT CCCCTAT GCCGGCAT CGACCTT GCAGTCTACGAGAC GCT CAAGAAT GCCT GGCT GCAGCACTAT GCAGT G 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCC^GGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAAT CCATGAT GAAAGGT GAGGTCACGTGGC CT CC CAGGC CT GACTT CCCAACCTACAGCATT GACGCCAACTT GGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
CCATGTTT GAGGGCGAAGGGCAGAGCGTTT GT GTGT T CTGGGGAGG GAAGGAAAAGGT GTT GGAGGCCTTAAT TAT GG 
ACTGT T GGGAAAAGGGT TTTGT C CAGAAGGACAAGC CGGACAAATGAGCGACTTCTGT GCT T CCAGAG GAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGACClAGCCCC^iCATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
AACTATTT TTATAGATTT GT TT AATTAATAGCTT GT CATTTT C AAGTT CATTTTTTATTCATATT TAT GTT CATGGTT 
GAT TGTAC CTT CCCAAGCCCGC CCAGT GGGATGGGAGGAG GAGGAGAAGGGGGGC CTT GGGCCGCTGCAGT CACAT CT 
GTCC^GAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GT TGGGGAAGGGCTT GCC CCCAGCCT TAGGATT T CAGGGTT TGACTGGGGGCGT GGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAAC^ATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAATCTT CAAT AGGAT GCAAAGAT CAAT GCAAAAATTGTTATATAT GAACAT ATAACTGGAGT C GT CAAAAAG 
CAAAT TAAGAAAGAATT GGAC GTTAGAAGTT GT CAT TTAAAGCAGCCTTCTAATAAAGTTGTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS I FKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHP VENI PE 1 1 LYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNIS^^ 

FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSG 
I LAREGVAAFYKGYVPNMLG 1 1 P YAG I DLAVYE TLKNAWLQH YAVNS ADPGVFVLLAC GTMS S T C 
GQLASYPLALVRTRMQAQAS IEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVS ISYV 
VYENLKI TLGVQSR 




Important features: 
Signal peptide: 

amino acids 1-16 



Putative transmembrane domains: 



amino acids 284-304, 339-360, 376-394 



Mitochondrial energy transfer proteins signature. 



amino acids 206-215, 300-309 



N-glycosylation sites. 

amino acids 129-133, 169-173 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 



GGAAGGCAGCGGCAGCTCCAC TCAGCCAGTACCCAGATACGC T GGGAAC C T T CCCCAGCCATGGC 
T TCCCTGGGGCAGATCC TCT TCTGGAGCATAAT TAGCATCATCATTAT TCTGGCTGGAGCAAT TG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CGT GATACAAT GGC TGAAGGAAGGT G TT T TAGGCT T GGTCCAT GAGT TCAAAGAAGGCAAAGAT G 
AGCTGTCGGAGCAGGATGAAATGT T CAGAGGCCGGACAGCAGT GTT T GCTGAT CAAGT GATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 

CAT CAC T T C T AAAGG C AAG GG GAAT G CT AACC T TGAG T AT AAAAC T GGAGC CT T CAGCAT G C CG G 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATACT CCTGTATGAT T GAAAAT GACATTGCCAAAGCAACAGGGGATAT CAAAGT G 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTATCT TCAAAGACATAT TAGAAGTT GGGAAAATAAT T CAT GT GAACTAGACAAGTGTGTTAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 

T GTAATGTTGCT CTGAGGAAGCCCCTGGAAAGTC TATCCCAACATATCCACATCT TATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T CTTCCCAAC T GACAAATGC CAAAGT T GAGAAAAAT GATCATAAT T T T AGCATAAACAGAGCAGT 
CGGGGACACCGATT T T ATAAATAAACT GAGCACC TT CT TT T T AAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVOTDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVS 
NTSFELNSEWTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVSSFFAISWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites* 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAAT CCTGGGATGT T CCCCTCTGGAACTCCAGGAGGACCATAT G 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT T GGTCT TCATT CAATGATGAGACCTGCCTCATGATGAT AAACATGTT T GACAAGACCAAGT CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCGACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCC TTCC GGGAGAAGGACACAGCT GTACAAGGCAACAT C CGGCTCAGCTT CGAGGACTT CGTCA 
C CATGACAGC T TCTCGGATGC TATGACCCAACCATC TGTGGAGAGTGGAGT GCACCAGGGAC CT T 
T CCTGGC TTC T TAGAGTGAGAGAAGTAT GTGGACAT CT CT TCT TT TC CT G T CCC T C TAGAAGAAC 
ATTC TCCC TTGCTT GATGCAACACT GTTCCAAAAGAGGGTGGAGAGT CCTGCATCATAGCCACCA 
AATAGT GAGGAC CGGGGCTGAGGCC ACACAGATAGGGGCCT GATGGAGGAGAGGATAGAAG T TGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCAT GAGAGAGATC T GT CTATGGGACCAGT GGC T TGGAT TCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T T GGGGCCAAAAGT CCAGT GAAAT T GTAAGCT T CAATAAAAGGAT GAAACTC TGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 

SGYI SMKELKQALVNCNWSS FNDETCLMMINMFDKTKS GRI DVYGFSALWKFIQQWKNL FQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 

DTAVQGNIRLSFEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-63-, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

T CC CAGGCC TCT T T GC CCGGAGCAT CGGTGT T GT GGAGGAGAAAGT TTC CCAAAACT TCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

ATCTGGGGTATCAATAAT CA&CCCC CAGGTACCAGCT GGGGAAATAT TAAT CGG TAT CCAGGAGG 

CAGCTGGGGAAATAT TAATCGGTAT CCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

TGCTCATCTCCAATAA^TAAAAGCACTTATGAAAA&A^^ 

AAAAAAAAAAAAAAAAAAA^ 



FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 




Important features of the protein: 
Signal peptide: 



Casein kinase II phosphorylation sites. 



amino acids 1-26 



amino acids 56-59, 155-158 



1 y 
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FIGURE 6* 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAM.TTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 



III 

m 




CTGAGTGGCAATAAATAAAATTCGGTATGCTG 



o 

rii 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 




N-glycosylation site. 



amino acids 37-41 



N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCGTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTXGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCT CTGGGCAGTCAGAT CCACC CAGT GCT T AATAGCAGGGAAGAAGGTAC TT CAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGAT GGGGCTAT T CACTT T TAT ATAT TTATATAAAAT TAG 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 2 6-32 
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FIGURE 6q 



GCCAGGAAT AACTAGAGAGGAACAATGGGGTTATTCAGAGGTTT T GT TTT CCTCTTAGTTCT GT GCCT GCT GCACCAG 
T CAAATACTTCCTTCATTAAGCT GAATAATAAT GGCTTT GAAGATATT GT C ATTGTTATAGATCCTAGTGTGCCAGAA 
GATGAAAAAATAATT GAACAAAT AGAGGATATGGT GACTACAGCT TCTACGTACCT GTTT GAAGCCACAGAAAAAAGA 
TT TTTTTT CAAAAAT GTAT CTAT ATT AATT CCT GAGAATTGGAAGGAAAAT CCT CAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAAC^ 

GAATGTGGAGAGAAAGGCGAAT ACATTCACTT C ACC CCTGACCT TCT ACT T GGAAAAAAACAAAATGAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTC^ 

TACCGTGCTAAGTCAAAAAAAAT CGAAGCAACAAGGTGTT C C GCAGGT AT CTCT GGT AGAAATAGAGTTTATAAGT GT 
CAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATGGAAAAGATTGTCAATTCTTT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTAT GCAAAGTATTGATTCT GTTGT T GAATTT TGT AACGAA 
{sfc AAAAC C CAT AAT CAAGAAGCTCCAAGC CT ACAAAACATAAAGTGCAAT TTTAGAAGT ACAT GGGAGGTGATTAGCAAT 

tn TCTGAGGATTTTAAAAACACCATACCCAT GGTGACACCACCTCCTCCACCT GTCTTCTCATT GCT GAAGATCAGTCAA 

Nf AGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
lij] AAACATTTCCTGCTGCAGACTGTTGAAAAT GGAT CCTGGGT GGGGAT GGT T CACTTT GATAGTACTGCCACTATT GTA 

fft AATAAGCTAATCCAAATAAAAAGCAGTGAT GAAAGAAACACACTCATGGCAGGATTACCTACATATCCT CTGGGAGGA 

' !? | ACTTCCAT CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCC CAACT CGAT GGATC CGAAGTA 

tlf CT GCT GCT GACT GATGGGGAGGATAACACT GCAAGT T CTT GTATTGAT GAAGT GAAACAAAGTGGGGCCATT GTT CAT 

H I TT TAT T GCTTT GGGAAGAGCTGCTGAT GAAGCAGTAATAGAGAT GAGCAAGATAACAGGAGGAAGTCAT TTTTATGTT 

TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
TC CCT TCAGCTCGAAAGTAAGGGATT AACACT GAATAGTAATGCCTGGAT GAAC GACACTGT CATAATTGATAGTACA 
GTGGGAAAGGACACGTTCTTTCT CAT CACAT GGAACAGT CT GCCTCCC AGT ATT T CT CTCTGGGAT CCCAGT GGAACA 
AT AAT GGAAAATTT CACAGT GGATGCAACTTCCAAAATGGC CT ATCT CAGT ATT CCAGGAACTGCAAAGGT GGGCACT 
* TGGGCATACAATCTTCAAGCCAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCT 
p| GTGCCTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATTGTTTACGCAGAA 
ATTCT ACAAGGATAT GTACCTGTTCTTGGAGCCAATGTGACTGC TTTCATTGAAT CACAGAATGGACATACAGAAGTT 
III TT GGAACTTTT GGAT AATGGTGCAGGCGCT GATT CT TTCAAGAATGAT GGAGTCTACT CCAGGTATTTTACAGCAT AT 

PI AC^GAAAATGGCAGATATAGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCA 
if. CTGAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAAGACCTGAAATTGAT 
FfJ GAGGATACTC^GACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 
f*f. CTTCC CTT GCCT GACCAATACCCACCAAGT CAAATCACAGACCT TGAT GCCACAGTT CATGAGGATAAGATTATT CTT 

ACOTGGACAGCACCAGGAGATAATTTTGATGTTGGAAAAGTT 

GATCTAAGAGACAGTT TTGATGATGCTCTTCAAGTAAATACTACTGAT CT GTCACCAAAG GAGG CCAACT CCAAGGAA 
AGCTT TGCATTTAAACCAGAAAA.TAT CTCAGAAGAAAATGCAACCCACAT ATTT ATT GCCATTAAAAGTAT AGAT AAA 
AGCAATTTGACATCAAAAGTAT C CAACATT GCACAAGTAACTTTGTTTAT C CCT CAAGCAAATCCTGATGACATTGAT 
CCTACAC CTACT CCT ACTCCTACTCCTACT CCT GAT AAAAGTCATAAT TCT GGAGTT AATATT T CTACGCT GGTAT T G 
TCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCAT TTGAA CCTTAACGAAGAAAAAAATCTTC 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGTAAAGGATATTTCTGAATCTTAAAATTCATCCCAT 
GTGTGATCATAAACTCATAAAAATAATTTTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
T GTAAAAACTGT CAAGATTAAAATTT AATAGTTTCATT TATTT GTTAT TT T AT TT GT AAGAAATAGTGATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGGA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKOTSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRI DS TTKL YGKDCQFFPDKVQTEKAS IMFMQS IDS WE FCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKH FLLQTVENGS WVGMVHFDS TAT ITOKL I QI KS SDERNTLMAGLPT YPLGGTS ICSGIK 
p YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
p SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
01 ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANS S VPP I TVNAKMNKDVNS FP S PM I VYAE I LQGYVPVLGANVTAF I E SQNGHTEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYI PGWVVNGE IEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SASI LDLRDSFDDALQVNTTDLSPKEANSKES FAFKPENISEENATHI FIAI 



13 KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 

Srf VNFILSTTI 

III 



Signal peptide: 



||| amino acids 1-21 



Putative transmembrane domains: 

amino acids 284-300, 617-633 



Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGT GGAAACCCT GGGAGTAGAGT ACT GAC1AGCAAAGACCG GGAAAGACCAT ACGT CCCCGGGCAGGGGT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCAC CTCCT AACCTTCC CTT CTT GAACCCCCAGTTAT GCCAGGATT T ACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCT CAGAAGT GACTT GCCTGAGGGT GGACCAGAAGAAAGGAAAGGT CC CCTCT TGCTGT T GGCT GCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACT GCCC AGGGCCAACAGCCC CAC GGGGAAGGAGGGGT ACCAGGC C GTC CTT CAGGAGTGGGAGGAGCAGCAC 
CGCAACTACG TGAGCAGCCTGAAGCGGCAGATC GCACAGCT CAAG GAGGAGCT GCAGGAGAGGAGTGAGCAGCTCAGG 
AATGGGCAGTACCAAGCC^GCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCI^GAGAAAACCCAGGCCGACCTC 
|'«| CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGC^ 

*J[ CCTTT CGATAGCT TT ACTCTACAGAAGGT GTACCAGCTGGAGACTGGCCTTACC CGCCACC CCGAGGAGAAGCCT GTG 

M S AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
III C C CAATCACC GT CCTT AC^CGGCCTCT GATTTCATAGAAGGGATCTACCGAACAGAAAGGGACIAAA.GGGACAT TGT AT 

GAGCT CAC CT TCAAAGGGGACCACAAACAC GAAT TCAAAC GGCT CAT CTT ATTT CGAC CAT T CAGCC CCAT CAT GAAA 
GTGAAAAATGAAAAGCTC^CATGGCCAACACG^TTATC^ 

CGGCAGTT CATGCAGAATTTCAGGGAGAT GTGCATTGAGCAGGATGGGAGAGTCCAT CT CACTGTT GTTTACTTT GGG 
AAAGAAGAAATAAATGAAGTCAAAGGAATACTT GAAAACACTTCCAAAGCT GCCAACT TCAGGAACTTTACCTT CATC 
C^GCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGT GAT GT GGACAT CT ACTT CACATCT GAATT CCT CAATACGTGT AGGCTGAATACACAGC C AGGGAAGAAG 
GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
CAGCAGCT GGTCATAAAGAAG GAAACTGGATTT T GGAGAGACTTT GGATTT GGGATGACGT GTCAGTAT CGGTCAGAC 
fl| TTCAT CAATATAGGT GGGTTT GATCT GGACATCAAAGGCTGGGGCGGAGAGGAT GTGCACCT TTAT CGCAAGTAT CTC 

p'i- CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 
W CTGAC CCC C GAGCAGTACAAGATGTGCAT GCAGT CCAAGGC CAT GAACGAGGCAT CC CACGGCCAGCTGGGCATGCTG 

jfU GTGTT CAGG CACGAGATAGAGGCTCACCT T CGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGA 

p» GAAGGATT GT GGGAGACACTTT T TCTTTC CTTTT GCAATTACT GAAAGT GGCT GCAACAGAGAAAAGACTT CCATAAA 

M GGACGACAAAAGAAT TGGACT GATG GGTCAGAGATGAGAAAGCCT CCGAT TTCT CTCT GTT GGGCTTTTTACAACAGA 

fil AATCAAAAT CTC CGCTT TGCCT GCAAAAGT AACCCAGTTGCACCCTGTGAAGTGT CT GACAAAGGCAGAAT GCTT GTG 

AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATT CAT GAT TTAAGAGCAGT TTT GTAAAAAATT CATTAGCATGAAAGGCAAGCATATTT CT C CTCATAT GAAT GA 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACT 
AGTGAG TACATT AAGTAAAA.TAAAAT GGACCAGAAAA.GAAAAGAAACCATAAATATC GTGT CATATTTT CC CCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGT GAGTTATAGTCT GCTTATTT AATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGT T GGCCT AC 
ATTTT TATATTTT TT AAGAAGAT ACTTTGAGAT GCATTAT GAGAACTTTCAGTT CAAAGCAT CAAATT GAT GCCATAT 
CCAAGGACATGCCAAATGCTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACT 
AATACAGACGTACAGATACTTT CTCT GAAGAGTATTT T CGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAAT GAC 
ACTTT CTGCTTTACAGAAAAGGAAACT CATTCAGACT GGT GAT ATCGT GAT GTACCTAAAAGTCAGAAACCACAT TTT 
CT CCT CAGAAGT AG GGACCGCTT TCTTACCTGTT TAAATAAACCAAAGTATAC CGTGT GAAC CAAACAATCT CTTTTC 
AAAACAGGGTGCTCCTCCT GGCTTCT GGCT TCC ATAAGAAGAAAT GGAGAAAAAT ATATAT ATAT ATATAT ATAT TGT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAATTATT TT TTAAATT AAGCAGT T CTACT CAAT CACCAAGATGCTTCT GAAAATT GCATTTTATTACCATTT 
CAAACTAT TTTT TAAAAAT AAAT ACAGTT AACATAGAGTGGTTT CTT CATT CAT GTGAAAATTATTAGCCAGCACCAG 
AT GCAT GAGCTAATTATCT CTTTGAGTCCT T GCTT CT GTTT G CT CACAGTAAACT CATTGTTTAAAAGCTT CAAGAAC 
ATTCAAGCT GTT GGT GTGTTAAAAAAT GCAT TGTATT GATTT GT ACT GGT AGTTTATGAAATTTAATTAAAACACAGG 
CCATGAAT GGAAGGT GGTATT GCACAGCT AATAAAATATGAT T T GTGGATATGAA 
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FIGURE 72 



MMMTORGLl^WISRVWLLVLLCCAISVLYMl^CTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKR^ 

TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
: n NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
f*l KYLHSNLIVWTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLQ^LVFRHEIEAHL 
ill RKQKQKTSSKKT 

k 



m 

h 
III 



Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



it! 
W 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
p! ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACAT TGTACATTATGACTCTGATTCCTATGACAGCTTGAGT G 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
« AATATAGCT TATGAACACATTCTGAGT CACTTGCAT GAAGTCAGGCATAAAGATCAGAAGACCTC 

p AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCT^^ 

III GCTCGCTCACAACTCCCCCTTGCrACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
*!! AT T TCAATGGAACAG C T GGAAAAGC T T C AGGGGACAT T G T T C T CC AC AG AAGAGGAGC CC T C T AA 

fit 

GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

M' 

fll TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTGTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAG 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTGATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGC TGTGTTGT TAAT GCAGAGANCAAACTCTGT TTAGT T GCAGGGGAAGT T T GGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGIAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLIAVYFIARKIRKKRLENRKS 
WFTSAQAT TEA 



CI 

p : f Important features of the protein: 

Ill Signal peptide: 



amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

PI N-glycosylation site. 

r* 

b! amino acids 213-216 

n's 
fw 

M 

f|| Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7S 
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TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGaATGTCCTACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTTCATCACAA 
AT T GG CC CACC AGAGG T GGC AC T G AC T ACAGAT GAG AAG TC CAT T TCTG TTG T C C TGACAGC TC C 
AGAGAAGTGGAAGAGAAATCCAGAAGACC TTCC TG TT TCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGC CCCCTCGCCGTGCT C AGCC T T C T GAGAAGC AGT GT GCC AGGAC T T T GAAAGAT CAATC AT 
||1 CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
III TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GAT T T T GAT T TAT GGAAATGAAT T T GACAAAAGAT TC TT TGTGCC TGC T GAAAAAATCGTGAT TA 
ACT T TAT CACCCT CAATAT CTCGGATGAT TCT AAAAT T T CT CATCAGGATATGAGTT TACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
AAAACAC GGAAGGTAC T TC TC TCACC CAGCAAGAG TC CCTCAGCAGAACAATACCCCCGGAT AAA 
ACAGTCAT T GAATATGAATAT GAT GT CAGAACCAC TGACATTT GTGCGGGGCCT GAAGAGCAGGA 
GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 
TCTTGGGCCCGCAAACGT TACAGTAC T CATACACCCCTCAGCT CCAAGACT TAGACCCCCT GGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCT.TCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
TG T GCAGATGGAAAACTGAT GCCAACACT TCC T T T TGCC TT TTGTTTCCT GTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKZ^IIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPi^LILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQEMEEWGLYVQMEN 



0 

01 Important features: 

Signal peptide: 



amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAAXGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGT TCCACATGACGACTGAGGCCCAAGCCAC CATC CGCATGGACACCAGT GCAAGT GGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
C CAAT CT AGTGAAAAACCAGCT G T G T CCCGTGATCGAGGCT TC CT TCAATGGCATGTAT GCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T TTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACAT CAGCTCTGATCGGATCCAGCTG 
ATGAACT CT GGGAT T GGC T GG TT CCAACC TGATGT TC TGAAAAACATCATCACTGAGAT CATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKyiKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGSLVNTVLKHI IWLKVI TANT LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQAT IRMDT SASGPTRLVLSDCATSHGSLRI QLLYKLS FLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KW FNNSAAS LTMPTLDN I P FS L I VS QDWKAAVAAVLS PEE FMVLLDS VL PE SAHRLKS S IGL IN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNNI S SDRI QLMNSGI GWFQPDVLKNI I TE I IHS I LLPNQNGKLRS GVPVS LVKALG 
FEAAESSLTKDALVLTPASLWKPSS PVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGT CAGCCTGGCAGAGAGACTCTGAAATGAGGGAT TAGAGGTGTT CAAGGAGCAAGAGC 
T TCAGCC TGAAGACAAGGGAGCAGTCCCT GAAGACGCT T CTAC T GAGAGGTC TGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
p GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
f|! ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 

III CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 

| is. 

M| CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
4i CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
J'' GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCT GT GAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGGAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 



||| 

fgj 

ll 

Hi GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T TGTGCATGAACT GAAATAAAACCAT CC TACGGT AT CCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 



AppJD= 
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FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I Y S TLLGLPAD I QAAQAMMVT SS AI S S LAC 1 1 S WGMRCT VFCQESRAKDRVAVAGGVFFI LGGL 
LGFI PVAWNLHGILRDFYSPLVPDSMKFE IGEALYLGI I SSLFSLIAGI ILCFSCSSQRNRSNYY 
DAYQAQPLATRS S PRPGQPPKVKSE FNS Y S L TGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 8l 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 

CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 

GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 

GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 

ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 

CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 

CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 

ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 

AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 

CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCG 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 

CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 

C TCGGACT T GGTGGAGATCTACCT CT GGAAGCTGGTAAAAGAT GAGGAAACTGAGGCT CAGAGAG 

GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCT TCCACT TGACTCAGATCAAGAAAGTCAG 

GAAGC AAGAC T TCC AGAAAGAGGCAC AGCACT T CC GAC T GCTCGCT GGCCCC CACGAAGGTCACT 

GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 

ACATAACAAAGACCTAACAGT TGCAGATATGAGCTGTATAATT GTTGT TATTATATATTAATAAA 

TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 82 



a 



1WPEVRVLSSLLGLALLWFPLDSHAMRPDMFCLFHGKRYSPGESWHPYLEPQGW 

GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 

PSRLPNQCVLC5CTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 

HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 

HGEWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 

ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 

SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 



Signal peptide: 

M : * amino acids 1-25 

HI 

ill 
III 

Si 

M 

m 
I 

III 

FU 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTTCTCT AGCGTGGTCC AC CTCT ACAGGGACGGGAAGGACC AGCC ATTT AT GC AGAT GC CAC AGT ATC 
| : A AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
Cl GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
E ? t AC AGGTGTCAGCAC TGGGCTCAGT TCCTCTCATTTCCATC ACGGGATATGTT GAT AGAGACATCC AGCTAC 



m 



TCTGTCAGTCCT CGGGCTGGTT CCCCCGGCCCACAGCGAAGT GGAAAGGTCCACAAGGACAGGATTTGTCC 
ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
CGCC GGGAGC AT AT CCTGTT CC AT GCGGCATGCTC ATCTG AGCCGAGAGGT GGAAT CC AGGGT AC AGAT AG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACC CGAAGCTCTGCGT TTCTG AT CT GAAAACTGT AACCC AT AGAAAAGCTCCC CAGG AGGTG 
CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
II CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
3 AGGAGT ACGT GACT TTGTCT CCCGATC ATGGGTACTGGGTCCTC AGACTGAATGGAGAAC AT TT GT ATTTC 

ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT TATTGAGGCCCT AC AT T GAGT ATCCGT CCT AT AAT GAGCAAAATGGAACT CCC ATAGT C ATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATG TAGG ATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GAT GAAGGGGGACT GGCCT GTCCAC AT G GGAGTC AGGT GTCAT GGCTGCCCT GAGC TGGG AGGGAAGAAGG 
CTGAC ATT ACAT TT AGTTT GCT CT C ACT CC AT CT GGCT AAGT GATC TTGAAAT ACC ACCT CT CAGGTG AAG 
AACCGT C AGGAATT CCC AT C TC AC AGGCTGTGGT GT AG ATT AAGT AGAC AAGGAATGTGAAT AAT GCT T AG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 



HI 
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FIGURE 8a 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKZVIWELQ 
VSALGS VPL I S I TGYVDRDI QLLCQSS GWFPRPTAKWKGPQGQDLS TDSRTNRDMHGLFDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHA.VEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 




LNPRFI S VFPRTPPTKI GVFLDYE CGT IS FFN INDQS LI YTLT CRFEGLLRP Y I EYPS YNEQNGT 



PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 



Signal peptide: 



amino acids 1-17 



Transmembrane domain: 



amino acids 239-255 



Hi 



Hi 



App_ID=10063538 



Page 236 of 320 



FIGURE 85 



AACAGACGT TCC C TCGCGGCCCTGGCACCTC TAACCC CAGACATGC TGCTGCTGCT GCT GCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AG T GGCCACAAACAACCCAGCT CGGGCAGTG TGGGAGGAGACT CGGGACCGATTCCACC TCC TTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATC TCAACGT GT CC TACCCGCC TCAGAAC TT GACCATGAC TGT CT TCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCT GT CAGGGGT TC 
AGCCTCTCAGGGGCCCCTGACT GAACCT TGGGCAGAAGACAGT CCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACT GCAGAGACTCACCCTGAT TGAGGGATCACAGCC CC TCCAGGCAAGGGAGAAGTCA 
GAGGCTGAT TCT T GTAGAATTAACAGCCC TCAACGTGAT GAGCTATGATAACAC TATGAATTATG 
TGCAGAGTGAAAAGCACACAGGCT T TAGAGTCAAAGTAT C TCAAACCTGAATCCACACTGTGCCC 
TCCCTT T TAT T TT TT TAAC T AAAAGACAGACAAAT TCCT A 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHGYWFREG 
i^TDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSWVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVT FPGASVTTNKTVHLNVSYPPQNLTMTVFQGDG TVS TVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPPA^HLRDAAEFTCRAQNP 
LGS QQVYLNVS LQSKATSGVTQG WGGAGATALVFLS FCVI FWVRS CRKKS ARPAAGVGDTG I E 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCAC TC TG T TGAGCTC CAGGGCGCAGT GGAGGGAGGGAGT GAAGGAGCTCTC T GTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCT T CCTGCTGTT TC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTT TGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCT GGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCT CCCTGCTGAGG TACCGCAC GGACAC T GGCT TCCTCCAGACACTGGGACATAATCT 
GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGC GGGAATTCACT GCGGGAT TTGT TCAG T T CAGGGT AT TTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

GGAACTCAT GT TGG T TACAGCAGCAGCCG TGAGATAACT GAGGCAGCTGTGCT T CTAT TCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAACT TACCCAGTAGC TAGAAT GTTAATGGCAGAAGAGAAAACAATAAAT CATAT TGACTCAAGA 



AAAAAA 



FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQT FCDMTSGGGGWTLVASVHENDMRGKCTVGDRWS S QQGSKADYPEGDGNWANYNT FGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWT DNGPV I PWYDFGDAQKT AS Y YS P YGQRE FTAG FVQ FRVFNNE RAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 




N-glycosylation site. 



Signal peptide: 



amino acids 1-16 



amino acids 163-167 



Glycosaminoglycan attachment sites. 




N~myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 



amino acids 74-78, 289-293 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAG7GGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
^ ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
f?t TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
H! TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 

W 

HI 



III 



I? I 



GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGACT T ATATACAGT T TTAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE QQ 

MGRVSGLVPSRFLTLLAHLVWI TLFWSRDSNI QACLPLT FTPEE YDKQDI QLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMMj^ 

TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Ql 



III 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTGACCTAAGTCCCGCCCCATGGCCAATTCCAGTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAAC AC AATGCCACAGGCCAGGGAAACATC T CCAT C AGCC TCGTGCCCCCCAGTAAAGCT GTAG 
f|| AGT TCCACCAGGAACAGCAGATCT TCATCGAAGCCAAGGCCT CCAAAATCT TCAACTGCCGGATG 

||! GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
T C T AC AT CGCC T TC T ACAGC AC GGACT AT CGGC T G G T CCAGAAGGT GT GCCC AGAT TACAAC T AC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
p TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
||1 AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
P AGAAAGGGT CCCAAGTGCTGGT CCCAACC T GAAGC T GTGGAG T GACTAGAT CACAGGAGCACTGG 

jj: AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
«? TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
S VHFQHNAT GQGN ISIS LVPPSKAVE FHQEQQ I F I EAKAS KI FNCRMEWEKVERGRRTS LCTHDP 
AKI CSRDHAQS SAT WSCS QPFKVVCVY I AFYSTDYRLVQKVCPDYNYHS DTP YYPSG 



Important features of* the protein: 
h k Signal peptide: 

amino acids 1-14 



N-glycosylation sites. 

HI amino acids 62-65, 127-130, 137-140, 143-146 



2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGAT TTGCATAT TATAAACTCT TAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
T T T T CTGAAAATCCCT TT T TCT GGT GGAAT TGAGAAAGAAATAAAACTATGCAGAT A 
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FIGURE Q4 

MTAAVF FGCAF I AFGPALAL YVFT I AI E PLRI I FLI AGAFFWLVS LL I S S LVW FMARV 1 1 DNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSABHTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site* 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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01 

hi 



AATT T T TCACCAGAGTAAACT T GAGAAACCAACTGGACCT T GAG TAT T GTACATT T TGCCTCG TG 
GACCCAAAGGTAGCAATC TGAAAC ATGA GGAGTACGAT TC TACTGT T T TG TC T T CT AGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGATGCT CACACTGGGGCCAGAT CT GCATCTGTTAAATCC TGCTGCAGGAATGACACC TGGTAC 
CCAGACCCACCCATTGACCC TGGGAGGG TTGAATG TACAACAGCAAC TGCACCCACAT G T G T TAC 
CAATTT T TGTCACACAACT TGGAGCC CAGGGCAC TAT CC TAAGCT CAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACT CCCAGTGGCACAGATGACGACTT TGCAGTGACCACCCCT 
GCAG GCAT C GAAAG GAGC AC AC AT GC C AT C G AGGAAG CC AC CACAGAAT C AG CAAAT G GAAT T CA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGAT T ATATTAT GGAATAGAT T GAGACACAT T GGATAGT CT TAGAAGAAAT TAAT T CT TAATT 
TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTrAAAAA 
CAATAATTCAATGGATAAAT CT GT CT TTGAAATATAACAT TATGCTGCC TGGAT GATATGCATAT 
TAA^CATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
T DD D FAVT T PAG I QR S T HAI EE AT T E S ANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
f* h GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
p CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
jp ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
gft CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
C[ I CCCTAGAGCCTATC CACC TGGCAGAGAATCTCAAAG T CCTATACCCGCACCACATGGCCCAGGTA 

W CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
111 TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
||! AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
m ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
m CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
J,., ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
IJ CAGG TGGACCCCAC TTCAGGCAG TG T GACGCTGGGGGTGC TCC CAC T C CGAGCAGGCCAGAACAT 

||| CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
jp't T CGAAGT CGCAG TCACAGATATCAAT GAT CACGCCCCTGAGT T CAT CACTTCCCAGAT TGGGCC T 

E ATAAGCCTCCCTGAGGATGTGGAGCCGGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
C CT CGAGCCCGCCT TCCGCCT CATGGAT T T TGCCAT T GAGAGGGGAGACACAGAAGGGACT T TTG 
p GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
f|| GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTT TCC TGCT GAG CATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCT CT GCACACCCCGCCAAGACCAT GGC T TGATCGTGAGTGGACCCAGCAAGGACC CCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATT TTCACCCAC TGGACCATG TCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTC TGAA TGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CCCC TGGGAGAGAGCCCAGCACC CAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQi\EYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T S LDHALERT YQL LVQVKDMGDQAS GHQAT AT VE VS 1 1 E S T WVS LE P I HLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVU^MDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRIiMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHI IPVWSHNAQ1WQLLTOVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGI FLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT T C T TCCCAGGAACACAAACGTAGGAGACC CACGCTCCTGGAAGCAC CAGC CT T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGAC CCAGGCAT C TT GCT T TCCAGCGACAAAGAGACAGATGAAGATGCAGAAAGGAAAT G 
TTCTCC TTAT GTT TGGT C TACTAT T GCAT TTAGAAGCT GCAACAAAT T CCAAT GAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 

GCATAGT CACCAAC TCTGAGT T CCAT ACAACCTCCAGTGGGAT CAGCACAGCCACCAACT CTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
H GGCCAGCACAGCCACCAACT CT GAG T CCAGCACACCCTCCAG T GGGGCCAGCACAGTCACCAACT 

P CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
p AGT AGGGCCAGCAC TGCCAC CAACTC TGAGTCTAGCACAC T C T CCAGTGGGGCC AGCACAGCCAC 

fil CAACTCT GACT CCAGCACAACCTCCAGT GGGGC TAGCACAGCCACCAACT CTGAGTCCAGCACAA 

CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
W GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAAGTCTGAGTCCAG 
111 AACGACCTCCAATGGGGCTGGCACAGCCACCAAGTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
hi GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
m TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
* U GGCTAGCACAGCCACCAACTC T GAG T CCAGCACAACCTCCAGT GGGGCCGGCACAGCCACCAACT 

*. CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
Cl AGT GGGGCC AACACAGCCACCAACT C T GAGTCCAGTACGACCT CCAGTGGGGCCAACACAGCCAC 

|!1 CAACT CT GAGT CCAGCACAGTGTCCAG TGGGGC CAGC ACT GCCACCAACT CT GAGTCCAGCACAA 

K CCT CCAGT GGGGTCAGCACAGCCACCAAC T C T GAGT C CAGCACAACCT CCAGTGGGGCTAGCACA 

St GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
*« CACAGTGTCCAG TGGGAT CAGCACAGTCACCAAT TCT GAG T CCAGCACAACCTCCAGT GGGGC CA 

L3 ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
111 ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 

GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CAT GGCCTCAACCATGGCCT TGGT CCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGT TCTGGAGGAGACCAGTATCATCGATAGCCATGGAGAT GAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
TGCTCAT T T AGC TAAGAAATAAATACATC T CATCTAACACACACGACAAAGAGAAGCT GT GC T TG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CTGGCAT T CAAAATCTCCACAGT AAAAT CCAAAGAC C T CAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GAST VTNSGS S VT S S GAS TATNSE S STVS SRAS TATNS ES ST L S S GAS TATNS DS ST T S S GAS TA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWE I FLITLVSWA 
AVGLFAGLFFCVRNSLSLRMTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 



App_ID=l 0063 538 



Page 252 



FIGURE 1Q1 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGAT GT GATATATGCAT TCCAGGGGAAGGGAAAT TGTGGT GCT TCTGAACCCATGGTCAATT 
AACGAGGCAGT T TC TAGC TACT GCACGTACTTCATAAAGCAGGACTCTAAAAGCT T TGGAATCAT 
GGT GT CAT GGAAAGGGAT T T ACTT TATAC TGAC TC T GT T T T GGGGAAGC T T T T T TGGAAGCATT T 
H* T CATGCT GAG TCCC T T T T TACCT T TGATGT T TG TAAACCCATCTTGGTATCGCT GGAT CAACAAC 

*jj CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
|| GAT TAT AACTGGGGATGCATT T GTT CCT GGAGAAAGAAGT GTCATTATCATGAACCAT CGGACAA 

Li GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
111 TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
W CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTTTTGTG 
ATATTCACGAACCACTTCAACT CCTCATAT TCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
In TCT CGAAG T AATGCAT TTGCTGAAAAAAAT GGACT T CAGAAAT AT GAAT ATG TT T TACATCCAAG 

III AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
0 ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
f'P CCCAGGGAAATCCAC T T TCACGTCCACCGGTATCCAATAGACACCCT CCCCACAT CCAAGGAGGA 

pj; CCT TCAACT CT GGT GCCACAAACGG T GGGAAGAGAAAGAAGAGAGGC TGCGT TCCT TC T ATCAAG 

GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
AT AT T T G T ACAG T C T T G T T AAG T GG TAT T T T ATAAT C AC CAT T G T AAT CTTTGTGCTG C AAGAGA 
GAATAT T T GGTGGACT GGAGAT CATAGAACT TGCAT GT TACCGACTT T TACACAAACAGCCACAT 
TTAAATTCAAAGAAAAATGAGTAAGAT TATAAGGTTT GCCATGTGAAAACCTAGAGCATATT TTG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCC AT CAT T AT TT GT TAAAGAT AT TT TGCAC TTAAT T T TGT GGGAAAAATAT TGCT ACAAT T T T T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 

GGGCCAGAAT AT TATTAAACAATCAT CAGGC TT T T AAA 
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FIGURE 102 



MHSRGRE I WLLNPWS INEAVSSYCTYFIKQDSKS FGIMVSWKGI YFILTLFWGS FFGS I FMLS P 
FLPLMFWPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERS VI IMNHRTRMDPflM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLI FPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTT GFT FWDRLREGKNLDAVHD I TVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQS VI PPCKS ELRVL WKLLS I LYWTLFS PAMCLLI YLYSLVKWYFI I T I VI FVLQERI FGG 




LEI IELACYRLLHKQPHLNSKKNE 



Important features of the protein: 
Signal peptide: 



amino acids 1-22 



Transmembrane domains: 



amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 

CGGCT CGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCT GCGCC TGAGACAGCT GGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT C T ACAGAGATGGGGAAGAC TGGGAATC TAAGCAGAT GC CACAGTATCGAGGGAGAAC T GA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTAT GGGTGCT GGT T CAGT TCCCAGAT T T ACGAT GAGGAGGCCACCT GGGAGCT G 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
h« GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
gS AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
m ATT AT AGTCCAGGAAAATGC TGGGAGCATAT T GTG T T CCAT CCACC T T GC TGAGCAGAGT CATGA 

!;*! GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
Ms TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
III AAATCCAAAGGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAG 
II! AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
iil T TT CT GATCT GAAAAC TGTAAC CCAT AGAAAAGCTCCCC AGGAGGTGCCTCAC T C T GAGAAGAGA 

TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGT GGGACAAAATGTAGGG T GGTAT GTGGGAGT GTG T CGGGAT GACGTAGACAGGGGGAAGAACA 
F ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
£jl ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
lit GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 

TGACATGTCAGT TTGAAGGCTT GTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 
GGGAC TCCCAT ATT CAT ATGTC CAGT G T CC TGGGGATGAGACAGAGAAGACCCT GCTTAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
P CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
h TGAGC CCTGCAGCAGC GGCAGT CACAGCT TCCAGAT GAGGGGGGATTGGCCTGACC CT GT GGGAG 

TCAGAAGCCAT GGCT GCCCT GAAGT GGGGACGGAATAGACT CACATTAGGT T TAGT T T G T GAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAG T GAT T CCT GCCT CACAGGTGAAGAT TAAAGAGACAAC GAAT GTGAAT CATGC T T GCAGG TT 
TGAGGGCACAGTGTTTGCTAAT GATGTGTTT TTATAT TATACAT T T TCCCACCATAAACTCTGTT 
TGCT TATTCCACATTAAT T TACT T T T C TC TATACCAAAT CACCCATGGAATAGT TATT GAACACC 
TGCT T TGT GAGGCTCAAAGAATAAAGAGGAGGTAGGAT T T T T CACTGATTCTAT AAGCCCAGCAT 
TACCT GATACCAAAAC CAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATAT CC 
CTCATTAACACAGACACAAAAATTCTAAATAAAATTTTAACAAATTAAACTAAACAATATATTTA 
AAGAT GATATATAAC T ACTC AGT GTGGT TT G TCCCACAAAT GCAGAGTT GGT T TAATAT T T AAAT 
ATCAACCAGTGTAAT T CAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 1Q4 



MAFVLI LVLS FYELVSGQWQVTGPGKF^QALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPL I S I VGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVE I S I 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTL S PNNGYWLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 

TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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m 



CCTT C ACAGGACTCTTCAT T GC T GGTTGGC AATGAT GT ATCGGCC AGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACTGTTC AT TATGTGAGAT AT AATCAAAAGAAG ACCT AC AAT T ACT AT AGCAC AT TGTCAT TT AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTC AAC AGAAGCAT GGAGT GTT GGCTC AT AT GCT GTTGATTT GT AGATTTC ACTCTACTGAGGATCCTGA 
AACT GT AG AT AAAAT TGT TC AACT TGTT TT ACAT GAAAAGCT GC AAGATGCT GT AGGACCCC CT AAAGTAG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
AC ACGAAG AAGT AAAACTCT AGGT CAGAGTCT CAGGATCGT T GGTGGG AC AGAAGT AG AAGAGGGTGAAT G 
GCCC TGGC AGGCTAGCCTGC AGT GGG ATGGGAGT C ATCGCT GTGGAGCAACCT T AATT AAT GCC AC ATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
\fl AT AAAACCTTCGAAAATG AAACGGGGTCT CCGGAGAAT AATT GT CC AT GAAAAAT AC AAACACCCAT C ACA 

|$| TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
||1 TCCCTGATGCAT CCT ATGAGTTTCAACCAGGT GATGTG ATGT TT GTGACAGGAT TTGGAGCACT GAAAAAT 

S GATGGTTACAGT CAAAATCAT CTT CGAC AAGC ACAGGT GACT CT C AT AGACGCT AC AACTT GCAATGAACC 

W TC AAGCTT AC AATGACGCCAT AAC TCCT AG AATGTT ATGTGCTGGCTCCT TAGAAGGAAAAACAGATGCAT 

GCCAGGGTGACT CTGG AGGACC AC TGGTT AGT T CAG AT GCT AGAGAT ATCTGGTACCTTGCT GGAATAGTG 
AGCT GGGGAGAT GAAT GTGC GAAACC C AACAAGCCT GGTGT T TAT ACT AG AGT T ACGGCCTT GCGGGACT G 
GATT AC TTCAAAAACT GGTATC TAA GAG ACAAAAGC CT CATGGAAC AGAT AACATT TTTT TT T GT TT T TTG 
GGT GT GGAGGCC AT T TTT AGAGAT AC AG AATT GGAG AAGACT TGC AAAACAGCT AG ATTTGACT GATC TC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATCAACTCTGTCATCTGTGAGCAATAGTTGAAACTTTATGTACATAGAGAAATAGATAATACAATATTAC 
ATTACAGCCT GT AT T C ATTT GT TCT CT AGAAGT T TT GT CAGAAT TTTG ACT T GT TGAC AT AAAT T T GT AAT 
GC AT AT AT AC AATTTGAAGC ACT C CTTT TCTT CAGTTCCTCAGC TCCTCT CATTTC AGC AAAT AT C CATT T 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GCAG AATACC AAT CAC TTCATC AT T T AGGAAGT ATG GGAACT AAGTTAAGGAAGTC C AGAAAGAAGCC AAG 
AT AT AT CCTT AT T T T C ATTT CCAAAC AACT ACT AT GAT AAAT GT GAAG AAGAT T CT GTTTTT T T GT GACCT 
• AT AAT AATTAT ACAAACT T CAT GC AATGT ACT TGTT CT AAGC AAATTAAAGC AAAT ATTT AT TTAACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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MMYRPDWRARKRVCWE PWVI GLVI FT S L I VLAVC I GLT VHYVRYNQKKT YNYYS T LS FT TDKL Y 
AE FGREASNNFTEMS QRLE SMVKNAFYKS PLREE FVKS QVIKFS QQKHGVLMMLL I CRFHS TED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RR 1 1 VHE KYKH P S HD YD I S LAE L S S PVP Y TNAVHRVC L P DAS Y E FQ P GDVM FVT G FGALKNDG YS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI T PRMLCAGSLEGKTDACQGDS GGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 1Q7 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 

ATAAT G T TCT CACT TTAACT GC AGAAACGAGGG T AGAGGAAGCAGTCAT T T TGACT TACT T TCC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 

GT GTAGAACT GGCT TGT GGCGT T T GGACATATGAACAGGAACT TAT GGT T CCAGTACAAT GGTCA 
GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
h% TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
U AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
M CAGGCCCACCAGGAAGATC T CAGT GACCT T TATCAAGAGGGT T GTGGGAAGAAAATGTATT CCT T 

M* T TT GAGAGGAACCAAACAACTGCAG GTGC T GAGGT T T CT GGGAATC TCCAT TGGGGT GACACAAA 

Ul TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ill ACAGACC AAATGAT GT CC T TGAAGAATGAC AACTC TCAGCACCT GTCATGTCCCTCAGTAGAAC T 

f I GTT GAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCT T TAATACACACT 

:-f T TGAGAT GGAGGAG TT ATAAAAAGAAATGTCACAGAAGAAAACCACAAAC T TGT T TTAT TGGACT 

III TGT GAAT T TT TGAGTACATACT AT G T GT T T CAGAAAT AT G T AGAAATAAAAATGT TGCCATAAAA 

« TAACACCTAAGCATATACTAT T C TAT GCT T TAAAATGAGGATGGAAAAGT T TCAT GTCATAAGTC 

n ACCAC CT GGACAATAAT TGATGCCC T TAAAATGCTGAAGAC AGATGTCATACCCACTGTGTAGCC 

m TGT GTAT GACT T T TAC TGAACACAG TTATGTT TT GAGGCAGCAT GGT T TGAT TAGCAT T T CCGCA 

j£| T CCAT GCAAACGAG T CACATAT GGTGGGACTGGAGCCAT AGTAAAGGT TGATT T ACT T CT ACCAA 

M C TAGTATATAAAGTAC TAATTAAATGCTAACATAGGAAGTTAGAAAATACTAATAACTT TTATTA 

P.j CTCAGCGATCTATTGTTCTGAT GCTAAATAAAT TATATATCAGAAAACTTTCAATAT TGGTGACT 

ft ACCTAAATGTGAT TT TTGCTGGTTACTAAAATATTCT TACCACTTAAAAGAGCAAGCTAACACAT 

TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
1 H T TCAGT TCT GATAAT GT TAAGAATAACCAT TAT GAAAAGGAAAAT TT GTCCTGT AT AGCAT CATT 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TAT T T AAAT AC T T AACC AC T AAT T T T GAAAAT T ACC AGT GT GAT ACAT AG GAAT CAT TAT T C 
AGAATGTAGTCTGGTC TT TAGGAAGTATTAATAAGAAAATTTGCACATAACTTAG TTGATTCAGA 
AAGGAC T T GTATGCTGTT T T TC T CCCAAATGAAGACT CTT T TT GACACTAAACACT T TT TAAAAA 
GCTTAT CT T TGCC TT CTCCAAACAAGAAGCAATAG T C T CCAAGTCAATATAAAT TCTACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 

C T TT GTT T TAT T T CAC TGAT TAATATACTGT GGCAAAT T ACACAGAT TAT TAAAT T TT T T TACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 

T TCTCAGAATATGGAAAGAAAAT TAAAATGTGTCAATAAATATT TTCTAGAGAGTAA 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLMS I SVLAVSAWMRDYLNNVLTLTAETRVEEAVI LTYFPWHP 
VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWT YEQELMVPVQWS DMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCP SVELLKPS LSRI FEHT SMANS FNTHFEMEEL 



Lh Signal peptide: 

P amino acids 1-33 

ft- 

ill Transmembrane domains: 

[fi amino acids 12-35, 57-86, 94-114, 226-248 

ui 
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FIGURE 1QQ 

CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CT T TC TC GT GAGAGC CT AGAG GCCTT AAAAAAAAAAGT GCT T G AAAGAGAAG GGGAC AAAGGAACA 
CCAGT AT T AAGAGGATT TT CCAGT GT T TC T G GC AGTT GGTCCAGAAGGAJTGCCT CCATT CCT GC TT CT C AC CT G 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACT GACCACCAGTT GGAT GAGTCTCAAGGTCCTCCT CTAT GT GACAACC ATGT GAAT GGGGAG 
TGGTACC ACTTCACGGGC AT GGCGGGAGATGCCAT GCCT ACCTTCTGCATACCAGAAAACCACT GT GGAACCC A 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
M TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
U CT GACC AAGC CCAGC GT CT GCT T CCAC GT CT ACT GT GGT C ATT T TT AT GACATCT GC GAC GAG GACT GCCAT GG 

Cf CAGCT GC T CAGAT ACC AGC GAGT GCAC AT GC GCTC C AGGAACT GT GCT AGG C CCT G ACAG GC AGAC AT GCTTTG 

AT GAAAAT GAAT GT GAGC AAAAC AAC GGT GGCT GC AGT GAGAT CT GT GT GAACC TC AAAAACT C CT AC C GC T GT 
GAGTGTGGGGTTGGCC GTGT GCTAAGAAGTGATGGCAAGACTT GTGAAGACGTT GAAGGATGCCACAATAACAA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
CT GGT T GGT GGCCT GGAGCT CT TCC T GACC AACAC CT CC T GCC GAGGAGTGT CCAAC GGC ACC C AT GT C AACAT 
p*t C CT CT TCT CT CTC AAGAC AT GT GGT AC AGT G GT CGAT GT GGT GAAT GAC AAGAT T GT GGCC AGC AACC TC GT GA 

m CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
Pf ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
jffj CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
Q GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 

TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 

CCGGG AT GGCTGTGTTT CAGAT GACTC GGT AAAGC AGTACACAT CCC GGGAT CACCT AGCAAAGCACT TCC AGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGT C T ACAGGGCC AGACGC TAAC AGGC GGCCC GAT CCGCAT C GAC T GGGAGGAC TAGT TC GT AGC C AT AC CT C 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CACAAAATCAGAAGCTGGGTATAATATTTCAAGTTACAAACCCTAGAAAAATTAAACAGTTACTGAAATTATGA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
T TAT AGG GAAT T T GGAAGT GT AT CAAT AAAAC AGTAT AT AAT T T T 
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MPPFLLLTCLFITGTSVSPVMiDPCSAYISIiNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
T FC I PENHCGT HAP VWLNGSHPLEGDGI VQRQACAS FNGNCCLWNTTVEVKACPGG YY VYRLTKPSVC FHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNT SCRGVSNGTHVNILFSLKTCGT WDWNDKIVASNLVTGLPKQT PGSSGDFI IRTS KLLI PVT 
CEFPRLYTIS E G Y V PNL RN S PLE IMS RNHGIFPFTLEI FKDN E F E E P Y RE AL PT LKLRD SLY FG I E PWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 



Important features of the protein: 
Signal peptide: 

||| amino acids 1-16 



pi N-glycosylation sites. 

» amino acids 89-93, 116-120, 259-263, 291-295, 299-303 



Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 



Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 



ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE 111 

GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGC CAGTGCT GACCAGG GACTTCT GACCT GCTGGCCAGCC AGGACCT GTGTGGGGAGGC CCT 
CCTGCT GCCTT GGGGTGACAAT CT C AGCT CCAGGCT ACAGGGAGACCGGGAGGAT C AC AGAGCC AGC ATGT 
T ACAGG ATCCTGAC AGTG AT CAACCTCT GAAC AGCCTCGAT GTCAAACCCCT GCGC AAACCCCGT AT C CCC 
AT GGAG AC CT TC AGAAAGGT GGGGATCCCC AT CAT C AT AGCACT ACTG AG CC TGGCGAGT AT CATC AT TGT 
GGT T GTCCT CAT CAAGGTGATT CT GG AT AAAT ACT ACTTCCT CT GCGGGC AGCCTCTCC ACT TCATCCCG A 
GGAAGC AGCT GT GT GACGGAGAGCT GGACTGT CCCT TGGGGGAGGACG AGGAGC AC TGTGT C AAGAGCTT C 
CCCGAAGGGCCT GCAGTGGC AGTCCGCCT CTC CAAGGACCGATCC ACACTGCAGGTGC TGGACT CGGCCAC 
III AGGGAACTGGTT CT CTGCCT GT TT CGAC AACT TCAC AG AAGC TC TCGCTGAGAC AGCCTGT AGGC AGATGG 

GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
T C AG C ATCCAGT AC GAC AAACAGC ACGTCT GT GGAGGG AGCATCCTGG ACCCCC ACT G GGTCCT C ACGGC A 
fk GCCC AC TGCT T C AGGAAACATACCGATGTGTT CAACTGGAAGGT GCGGGC AGGCTC AG AC AAACT GGGCAG 

i CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
g| TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
111 GATGAGGAGC TCACT C C AG CCACCCC ACTCTGGAT C ATTGGATGGGGCTTT ACGAAGC AGAATGGAGGGAA 

P GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 
[II ACCAGGGGGAAGTCACCG AGAAGATG AT GTG T GCAGGC AT CC CGGAAGGGGGTGTGGACACC T GCC AG GGT 

GACAGT GGT GGGCCCC TGAT GT ACCAATCT G ACCAGTGGC AT GT GGTGGGCATC GTT AGCTGGGGCT ATGG 
CT GCGGGGGCCCGAGC ACCCCAGGAGTATACACC AAGGTCTC AG CCTATCT CAACTGG ATCT ACAATGTCT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 

GGGGATCCCCCAAAGT CAGACACAGAGC AAGAGTCC CCTTGGGT AC ACCCCTCT GC CC ACAGCCT C AGC AT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GC AGCC CT AGCT CGGCCACACT TGGTGCTCCC AGCATCCC AGGGAG AG AC ACAGCC C ACT GAACAAGGTCT 
CAGGGGT AT T GCTAAGCCAAGAAGGAACTT T CC CAC ACT ACT GAATGG AAGC AGGCTGTCTT GTAAAAGCC 
CAGATC ACTGTG GGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGT CCGTCTTCACCCATCCC C AA 
GCCT ACT AGAGC AAGAAAC CAGTT GT AAT AT AAAAT GC ACTGCCCT ACTGT T GGT ATGACT ACC GT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 

AAAA 
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FIGURE 112 



MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS ILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSIAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 113 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGT CTGC T TAGACCACCTGG T T TAT G T GACAGGACT TGCAT T C T CCT GGAACAT GAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 

CCGATGGCGCAT GACACAC T CGGGAC TCACCTC TGGGGCCATC AGACAGCCG T TT CCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTC TGAGAGGCCCT CC TAT GT CCC TACTAAAGCCACCAGCAAGACATAGC T GACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAAT GCCT CCAGTAAGCACAGGC T GCAAAAT CCCCAGGCAAAGGACT GTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 

CAAAT GAT C TCCAAGGGCCCT TATAC CCCAGGAGACT T T GATT TGAAT T TGAAACCCCAAAT CCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGC TACT CGGGAGGCTGAGACAGGAGAAT TACT TGAACCTGGGAGG T GAAGGAGGCTGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGAC TC TGTC TC AGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 



FIGURE HA 

MLWWLVLLLLPTLKS VFC S LVT S LYLPNTEDL SLWLWPKPDLHS GTRTEVS THT VP S KPGTAS PC 
WPLAGAVPS PT VS RLEAL TRAVQVAE PLG SCG FQGGPCP GRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE 115 

CAGClkGTGGTCICTCAGTCCTCTC/W 

AGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GT T T TGGGGGAGCAAGCACTTCTGGC CGGAGGTACCCAAAAAAGCC TATGACAT GGAGCACAC T T 
TCTACAGCAATGGAGAGAAGAAGAAGATT TACATGGAAAT TGATCCTGTGACCAGAACTGAAATA 
TTCAGAAGCGGAAATGGCACTGAT GAAACAT TGGAAGTGCACGACTT TAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C TGAACCAGAAGAGGAAATAGATGAGAAT GAAGAAATTACCACAAC T T TC T T T GAACAGT CAGTG 
ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GAT T T GT GATAACGTGACCATGTAT T GGATCAATCCCACTCTAATAT CAGT TTC TGAGTT ACAAG 
ACT T T GAGGAGGAGGGAGAAGATCT T CAC T TT CCTGCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAACT T CCAATAAATGACTATACTGAAAATGGAATAGAATT T GATCCCATGCT GGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACC^AAAAAAAAAAAAAAAAAA 
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FIGURE 116 



iff 



MAKNPPENCEDCHI LNAEAFKSKKI GKSLK I CGLVFGI LALTL I VL FWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWARMLGRV 



Important features of the protein: 
lil Signal peptide: 

amino acids 1-40 



Transmembrane domain: 

amino acids 25-47 {type II) 

N-glycosylation sites. 

ti 

ipt amino acids 94-97, 180-153 

0 

III Glycosaminoglycan attachment sites. 

0 amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 

m 242 



N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 



Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



t a 
1. 1 5 



Fll 



GAGCTCCCCT CAGGAGCGCGT T AGCT T C AC ACCT T C GGCAGC AGGAGGGCGGCAGC TT CTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGT TCCAGT ACGAAGGGCT CT GGAGGAGC TGCGTG AGGC AGAGTT C AGGCT TC ACCGAATGC AGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TC TGTGTTTGCC AACATGCT GGTGAC TAACTT CTGG ATGTCC ACAGCT AACAT GT ACACCGGCAT GGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAAT TGGGGGTGTGAT GATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACT AC AAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
TGGGT CC AAC ACCAAAAACAAGAAGATAT ACGAT GG AGGTGCCCGC AC AG AGGACG AGGT AC AATCT T ATC 
CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 



f|| CCCAAAAAAC AAGGAG ATCCCAT CT AGATT T C TT CTTGCTTT TGACTCACAGCT GGAAGTTAGAAAAGCCT 

g CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

O GCTGAGTT AT T TAT GAATT AGAGGCT AT AGCT CACATT TTCAAT CCTCTAT TTCTTTTTTTAAAT AT AACT 

III TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 

M TT CCTCCT AGTC AAT AAACCCATT GATG ATCT AT TTCCCAGC TT ATCCCC AAGAAAACTT TT GAAAGG AAA 



GAGT AGACCC AAAGAT GTTATT TT CTGCTGT T T GAATTTTGT CT CCCCACCCCC AACTTGGC TAGT AATAA 
AC AC TT ACTGAAGAAG AAGCAAT AAGAGAAAGAT ATTTGT AATC T CTCCAGC CC ATGATCTC GGTTTTCTT 
ACACTGTGAT CTTAAAAGTTACCAAACCAAAGTCATTTTCAGTT TGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT TGTCCCTAGATG AAT GAGAAAATT ATTT T T TT TAAT T T AAGT CCTAAATATAGTTAA 
AAT AAAT AAT GT TTT AGT AAAATGAT AC ACTATCT C TGTG AAAT AGCCTCACCCCT AC ATGT GGAT AG AAG 
GAAATG AAAAAATAAT TGCTT T GACATTGTCT AT AT GGTACT T T GT AAAGTC AT GC TTAAGT ACAAATTCC 
AT GAAAAGCT CACACCTGTAAT CCT AGC ACT T TGGG AGGCTGAGGAGG AAGGAT C AC TTGAGCCC AGAAGT 
TCGAGACTAGCC TGGGC AACAT GGAGAAGCCC TGT CTCTACAAAAT AC AGAGAGAAAAAATC AGCCAGTCA 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 



AppJD=10063538 



Page 269 of 320 



FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPi^LQAVRAmiVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CA I AGVS V FANMLVTN F WMS TANM Y T GMGGMV Q T VQ T R Y T FGAAL FVGWVAG GLT L I GG VMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide : 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGrTCCGGAGTCC 

AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 

GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 

GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 

CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 

GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 

ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 

CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 

ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 

CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 

CAGATACTCGATACCTTCCCATCGCACAACCQAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 

TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 

ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTT TGATTTACTGTTCT TAACTGCCT 

AAT CTTAATTACAGGAACTGTGCATCAGCTATTTATGATT CTATAAGCTATTTCAGCAGAATGAGATA 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

ACGACATAGCAT TATGT ACAT AGATGAGTGT AACAT TTAT ATCTCACATAGAGACATGCTTATATGGT 

TTTATTTAAAATGAAAT GCCAGTCCAT TACACTGAATAAATAGAACTCAACTATTGCTT TTCAGGGAA 

ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 

TTTATAAT GAAGATTAAAATGAAGGCT TTAATCAGCATTGTAAAGGAAAT TGAAT GGCT TTCTGATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 

CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 

TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 

AAAAT ATT TTTGTTTTT GTATTTGAAGAAGAAT GAT GCAT TTTGACAAGAAATCATATATGTATGGAT 

AT ATT TTAATAAGTATT T GAGTACAGACTTT GAGGT TTCATCAATATAAATAAAAGAGCAGAAAAATA 

TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 

GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAA?VAATAAATTTCCTTCTTGTA 

CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 

ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 

TATACATT TATATTAAT AAAT TGTACATTTT TCTAATT 
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FIGURE 120 



MATHALE I AGL FLGGVGMVGTVAVTVMPQWRVS AFI ENN I WFEN FWEGLWMNCVRQANTI KMQCK 
IYDSLLALSPDLQAARGMCAASVMSFIAFM^^ I TG 

MWL I PVS WVANAI I RD FYNS I VNVAQKRELGEAL YLGWT TALVL I VGGAL FCCVFCCNE KS S S Y 
RYS I PSHRTTQKS YHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAAT GT CTGAGGGAAAGCT T T GAGGAGTCC TGGACACC CAACTAC 
AAGCAGT GTTCATGGAGT T CAT TGAATTATGGCATAGAT C TT GGGAAAAT TGCGGAG T GT ACAT T 
TACAAAGAT GCGT TGAAATAGT GC TCTAAGAGT TT TGT T CAGT GGCT CAC T TCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT GAAGCTATAATT TAT T TGGACCAAGGAAGC CC TGAAAT GAAT TCAACAAT TAATAT T CATCG 
CAC T T CT T CTGTGGAAGGACT T TGT GAAGGAAT TGGT GC TGGAT TAGTGGATGT T GC TATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT TAT T GAAGAACTACCA&A ATAAA T GCT TTAATT T T CAT T TGCTACC TC T TT T T TTATT ATGCC 
T TGGAAT GGT TCAC TTAAAT GACAT T TTAAAT AAGT T TAT GTATACAT CTGAAT GAAAAGCAAAG 
C TAAATATGT T TACAGACCAAAG TG T GAT T TCACACT GT T T TTAAAT C TAGCAT TAT T CAT T T T G 
C TT CAAT CAAAAGT GG T T TCAATAT x TT T T T TAGT T GGT TAGAATAC T T TCT T CATAGTCACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTT TGTACAAT TTGTAAAT GT TAAGAATTTTTTTTATATCTGT 
T AAAT AAAAAT TAT T T CC AAC A 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCCT GGAAGAATACATCAT GT T T T TCGATAAGAAGAAATT GTAGGATCCAGT TT T T T T T TTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGGATGGGTTTCAATG 
TAAT T AGGCTAC TGAGCGGATCAGCTGTAGCAC TGG T TATAGCCCC CACTGTC T TACT GACAAT G 
« , CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
M ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
O CT T GACCATAACCATAT CAGCAAT AT TGACGAAAAT GCT T TTAATGGAATACGCAGACTCAAAGA 

[|| GCT GAT T C T TAGT T CCAATAGAAT C T CCTAT TT TC TT AACAAT ACC T TCAGACCTGT GACAAAT T 

Zl TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
J|f CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
M i AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
III TCT T TGC T GGCATGATCAGACT CAAAGAACT TCACC T GGAGCACAATCAAT TT T CCAAGCTCAAC 

||| CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
I" CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCC AACAAGCTCACATT TATTGG T CAAGAGAT T T TGGATT CT T GGATATCCC T CAATGACAT CAG 
111 TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
g| TTAAAGGTCTAAGGGAGAATACAAT TATCTGTGCCAGTCCCAAAGAGC TGCAAGGAGTAAATGTG 

p| ATCGAT GCAGT GAAGAAC TACAGCAT CTGTGGCAAAAGT ACTACAGAGAGGT T T GATC TGGCCAG 

Pi GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
frf TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
111 TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAAT GACTCCCAGCACCCAGGAAT T TTATGTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCT GCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCAT TGTGATAAAAAGAGCTCT TAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CATT T T CC TCTCATACATAAT CAACCCAT TGAAAT T TAAATAC CACAATCAATG T GAAGC T T 
GAACTCCGGTT TAATATAATACCTAT TGTATAAGACCCTT TACTGAT TCCATTAATGT CGCATTT 
GT T T T AAGATAAAAC T T C T T TCAT AGG TAAAAAAAAAAA 



ft 
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FIGURE 124 

MGFNVI RLLS GSAVALVI APTVLLTMLS SAERGCPKGCRCE GKMVYCE SQKLQE I PS S I SAGCLG 
L S LRYNS LQKLKYNQFKGLNQLT WLYLDHNH I SN I DENAFNG I RRLKEL I L S S NRI S YFLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTI ICASPKELQGVNVIDAVKNYS ICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and c(24P-dependent protein kinase phosphorylation site. 

amino acids 4 65-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 



App_ID=10063538 



Page 276 



FIGURE 125 



CCGTTAT CGTCT TGCGCTACIGCTGAATGTCCGICCCGGAGGAGGAGGAGAGGCT TT I'GCCGCTG 
ACCCAGAGATGGCCCCGA.GCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGCTTTCTAAAGCT TTGGCAAGGAGTGACACCCGCCAT TTACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
^ ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
f;J| T TAGCCAAT CCAAC T GAC CT AGTGAAGGT TCAGAT GCAAATGGAAGGAAAAAGGAAACTGGAAGG 

AAAACCAT TGCGAT TTCGTGGT GT ACATCATGCAT T T GCAAAAAT CT TAGCTGAAGGAGGAATAC 
GAGGGCT T TGGGCAGGCTGGGT AC CCAATATACAAAGAGCAGCACTGG TGAATATGGGAGAT T TA 
ACCACT T ATGATACAGTGAAACACTACT TGGTAT T GAAT ACAC CAC T TGAGGACAATATCAT GAC 
TCACGGT T T ATCAAGT T TATG T T C T GGAC T GGTAGCT TCTAT T C TGGGAACACCAGCCGAT GTCA 



t - 3 

w 
fff 

III 

||| TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 
» ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 

a 



li 

rw 
p 



ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKRKLEGKPIA 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 



13 amino acids 25-38, 130-147, 233-248 

fit 
it! 

W 
III • 

5::? I 

in 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATCC TC TCCAT GTTC TCAGCTAT CC G T TCTCAGCACAGCGGT G TAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGT TCAATGTGAACGT GCT GGCCCT CAGCATC T GCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCAC ATCAT TAACAT CAATAGCATGTC TGGCCACCGAGTGT TA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CAT CCAGAT TGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACC TAGT GACT GT GGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVT GASGG I GAAVARAL VQQGLKWGCARTVGN I EELAAE CKS AGYPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 

i^CTTCTAC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAAGTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT T T T T AAC T GATAG T T G T AC AT AT T T GGG GG TACAT GT GAT AT T T GGAT ACAT G TAT AC AA 
TATAT AATGATCAAATCAGGGT AACT GGGATAT CCAT CACATCAAACATT TAT TT TT T ATT C TT T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
C TCT GCC TGCCAGG T TCAAGCGAT TC TCATGCC TCCACC TCCCAAG TAGC TGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAAC TGTATAT T TGTACCAG T TAACCAACCGTAC TT CATC CCCAC TCCTCT CTAT CCT TCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATAT TTGTCTTTCTGT GCC TGGCTT AT TT CAC T TAACATAAT GACT TCCTG 
TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 
GCAAAAA 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGSSPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 

TTCT GAAGTAACGGAAGCT ACCTT GT AT AAAGAC CTC AAC AC TGCTGACCATGATC AGCGCAGCCT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
T CCAT CTGT GT GTCGCTGCGAT GC GGGTTTCAT T TACT GT AATGATCGCTTTCT GAC ATCCATTCC AAC AG 
GAAT ACCAGAGGAT GCTACAAC TCT C T ACCTT CAGAACAACCAAAT AAATAAT GCTGGGATT CCT T C AGAT 
TT GAAAAACT T GCTGAAAGT AGAAAG AATATACC T ATACC AC AACAGTTT AGAT GAATTT CC TACC AACCT 
CCCAAAGT AT GT AAAAGAGT T ACATTTGCAAGAAAATAAC AT AAGGACTAT CAC TT ATGATT CACTTTCAA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCC AGGACTAT AGAAGAACT ACGCTTGGAT GAT AATCGCAT AT C C ACTATTT CAT CACCAT CTCTTC AAG 
GTCT CACT AGTCT AAAACGCCT GGTTCT AG ATGGAAACCT GT TGAAC AATCATGGTTT AGGT GAC AAAGTT 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GAT GATTTGGAC AAT ATAAC AC AACTGATT CT T C GC AACAAT CCCTGGT AT T GC GGGTGC AAGAT G AAATG 
GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
AGGT TCGTGGGAT GGC TATT AAGGATCT CAAT GC AG AACTGT T T GATTGT AAGGAC AGTGGGAT TGT AAGC 
ACCATT CAGAT AACC ACT GC AATACCC AAC AC AGTG TATCCT GC CC AAGG AC AGTG GCCAGC T C CAGT GAC 
CAAAC AGCCAGAT ATT AAGAACCC CAAGCTCACT AAGG ATCAACAAACC ACAGGGAGTCCCT CAAGAAAAA 
CAAT T AC AAT TACT GTGAAGT C TGTC ACCTCT GAT ACC AT TC AT AT CTCT TGGAAACTTGCT CT ACCT ATG 
ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
AACAGGGG AACGCAGT GAGT AC TT GGTC AC AGCCCTGG AGCCTGAT TCACCCT AT AAAGTAT GC AT GGTTC 
CCAT GG AAAC CAGC AACCTCT ACC TATTTG ATGAAACTCCTGT T T GT ATT GAGACTGAAACT GC AC CCCTT 
CGAAT GTACAAC CCT AC AACCACC CTCAAT CGAGAGCAAG AGAAAG AACCTTACAAAAACCCCAATTT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AATGGATCGCTC T T CT CAAGGAACTGTGC AT ATAGC AAAGGG AGGAGAAG AAAGGATG ACT AT 
GC AG AAGCTGGC ACTAAGAAGGAC AACT CT ATCC TGGAAATC AGGG AAAC TT CT TTTC AG AT GT TACC AAT 
AAGCAATG AACC CATCTCGAAGGAGG AGTTTGTAAT ACAC AC CAT ATTTCCT CCT AATGGAATGAAT CTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
T C AC ACTC ATGAT GCT G AAGGACT C AC AGCAGACTTGTGT TT TGGGTTTT T T AAACCT AAGGGAGGT G ATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAim^ 

IKNPKLTKDQQTTGS PSRKT I T I TVKSVTSDT IHI S WKLALPMTALRLSWLKLGHSPAFGS ITET 
I VT GE RS E Y L VT AL E P DS P YKVCMVPME T S NL YL FDE T P VC I E TE TAP LRM YNP T T T LNRE QE KE 
PYKNPNLPIAAIIGGAVALVTIALLALTC 

ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
1 f GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
fj| TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CI CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 



m 

t z, 

M 

hi 

m 

II? 

P! 



GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACT CC AACAGAGCCAGGACT AT ATCAACC TC TT CT GC GC CAACATGATGGACTT GAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 



| ; || AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGAT GAAGAAT TATC TAAAGCTATTCAA 

13 TATCAGC AGCAT TTT TCGAGGAGAGT GAAGAGGCGAGAAAAACAAT T T CCAGAT TCTCGCTCTGT 

IV TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 



II? 

p 

III 

a 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



pi Important features of the protein: 

Signal peptide: 

amino acids 1-26 



01 Signal peptide 



Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 

GGT CT GAGTGCAGAGCTGCTGTCATGGCGGCCGCTCT Gl 1 GGGGCT TCT TTCCCGTCCT GCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGAT TTGATCCCGTT CGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACAT GAGACGGGAAATGGAGCAGTCAATGAATATGCTGAATTCCAACCATGAGT TGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGT GGCATCCAAGTCT TGGAAAACCGT G TGAAGCAACTACTATAAACT T 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGTCAATAT TGATGTCACTGAATTAATTACAGTGT CCTATAGAAAATGCCATTAATAAATTAT 
ATGAACTACTATACAT TATGTATATTAATTAAAACAT CT TAAT CCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 126 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGE EHVGFLKTDGS FWHD I PSGSYWEWS PAYRFDPVRVD I T SKGKMRARYVNY I KT S E 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFI^P*^^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of" the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cOlP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



ill 

in 
ill 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGC TACGT TGGCTTT CTGGAAGGGGAGGCTATATGCGTCAAT T CCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCT GT GTGATCGCCACAAACCTTCAGGAAATACGAAATGGAT TTTC TGAGATACGGGGCAGTG 



5 ! 

f*\ TGCAAGCCAAAGATGGAAACATTGACATCAGAATCT TAAGGAGGACTGAGTCTT TGCAAGACACA 



AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAAC T AC CAGAC CC C T GACCAT TAT AC T C T CCG GAAGAT CAGC AGC C T CG CC AAT T CC T T T C T T A 
CCATCAAGAAGGAC C TCCGGC TCTCT CATGCCCACAT GACATGCCAT TGT GGGGAGGAAGCAAT G 



rtl AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
H TTTGGGGGAACTAGACAT TCT T CT GCAATGGAT GGAGGAGACAGA ATAGG AGGAAAGT GATGC T G 

P C TGCTAAGAATATT CGAGGTCAAGAGC TCCAGTCT TCAATACC TGCAGAGGAGGCATGACCCCAA 

M.s ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 

pi 

CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
fn TGT AATATC TT T CT GC TATT GGAT AT AT T TATTAGT T AATATAT T TAT T TAT TT T T TGCTAT T TA 

| ; || ATGTATT TATT TTTTTACTTGGACATGAAACTTTAAAAAAATTCACAGAT TATATTTATAACCTG 

ACT AGAGCAGGTGATGTAT T T T TATACAG TAAAAAAAAAAAACCT TGTAAAT TCTAGAAGAGTGG 
C TAGGGGGGTT AT TCAT T TGTAT T CAACTAAGGACAT AT T T AC TCAT GCTGATGCT CT G T GAGAT 
ATTTGAAAT TGAACCAATGACTACT TAGGATGGGTTGTGGAATAAGT T TTGATGTGGAATTGCAC 
ATCTACCT T ACAAT TACTGACCAT CCCCAGTAGACT CCCCAGT CCCATAAT TGT G TAT CT TCCAG 
CCAGGAAT C CTACACGGCCAGCATG TATT TCTACAAATAAAG TTTTCTTT GCAT ACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGELDILLQWM 

E 



PI 



111 

fi 



Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites . 



p amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T TT GT GGCTGAAGCAAAGCCAC CT ACAGT TACTAT GC C T CGAAT CAAGGCAT TGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTCTTTTATGGAGATGAAACCTGGGTTAAATTA 
TTCCCAAAGCATT T TG TGGAAT AT GATGGAACAACC T CAT TT T T CGT G TCAGAT TACACAGAGGT 
H GGATAAT AATGTCACGAGGC AT T T GGATAAAGTAT TAAAAAGAGGAGAT TGGGACATAT T AATCC 

CI TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
p AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
{•ft GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
ACT TGGCT T ACCGATTCCAAAAGACAG T G TAGGGAGC CT CCTAT TCCCAGT TGT GGAAGGAAGAC 



m 



||I CAATGAGAGAGCAGTT GAGAT T T T TACAT T TGAATACAGTGCAGCT TAGTAAACTGT T GCAAGAG 

fit AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
[ GAACTGGATCAGAC TGTACTTGGAGGAAAAGCATTCAGAAGTCCTAT TCAACCT GGGCTCCAAGG 

^ TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
P TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
HI CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
P CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
h; GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
yf AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 

TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGTGAT T GTAAAGAGCTGGCGG T CACAGAGGAACAAGC CCCCCAGCT GAGGGGG T G T GTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACT CAGAT CCACAGAGCCCAGGATCAAGGGACCCACT GCAGTGGCAGCAGGACT G T TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C TGCACACAGTATGTAGT TACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 
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FIGURE lao 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLSDSVIRQAKZ^GKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASS TEEVNTPL ILI S SAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58.-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites ♦ 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
T CCCTATAGAAAACAACTGCCAGCACC T TAAGACCAC TCACACCT TCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTT GCAT TAGCC TCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT T GGGGTGACAGATAAAT T TGAGAACAGGAAACACAT TGAAT T T TCATT TCAACCAGT TT GCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLEYNFPIEWCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



CTAGAGAGT ATAGGGCAGAAGGATGGCAGATGAGT GAC T CCACATCCAGAGCTGCC TCCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT TT GCTGT CCACACAT TCAACCAACAGAGCAAGGACTAC TAT GCCT ACAGACTGGGG 
^~ CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
K GGGGAGAAC TAGGT GTGGGAAAT T TGAAGACGACAT T GACAACTGCCAT T TCCAAGAAAGCACAG 

|f| AGCT GAACAATACT T TCACCTGCT TC T TCACCATCAGCACCAGGCCC TGGATGACTCAG T TCAGC 

III CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCGATGT 
pi GCT GC TCCCACATT CC GTGGACAT CAGCACTAC TCTCCTGAGGAC TC T TCAG TGGCT GAGCAGC T 

||f TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT T GAGCC TAATCATGTAG T GTAGATCATTAAACATC AGCAT T T TAAGAAAAAAAAAAA 
Pi AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

U! 

\ !■# 

Wi 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



C TGTGCAGC'l'CGAGGC TCCAGAGGCACAC T CCAGAGAGAGC CAAGG T TCTGACGC GATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
||1 AAGC T TGCCAGGAGGC TGGCAGTACAGAGCGCAGCAGCGAGCAAATCC TGGCAAGTGACCCAGCT 

CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGT CACT CCAT TC T CAGCCCCTAGCAGAGCGTCTGGCACACTAGAT TAGTAGTAAAT GCT TGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 



U G TGTAT TCCCGT TT TACAGATAAGGAAAC TGAGGCCC AGAGAGCTGAAGTAC TGCACCCAGCATC 



ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C TC TAATGAAAT TG T GAAAGCT CC AT GT T TAGAAATAAATGAAAACACC TGA 
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FIGURE 146 



mKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEAJT^QFPDGIHYNGC^ 

VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of" the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 



App_ED=10063538 



Page 298 



FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 
amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGTGCAGAAAGAC CTTT GAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGAT TAAAAAAACATAT CCT CCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
H TGT TGATAACTT GAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATT GAGAAAGAAAGACAATCTA 

TAAGAAGCTCC CCACT T GATAATAAGT TGAAT GTGGAAGATGTTGATTCAACCAAGAAT CGAAAACTG 
ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 
Ul TCAACTAGACGGGACTCCT TTAACCGC TGAAGACAT TGTCCATAAAATCGCTGCCAGGATTTATGAAG 

||! AAAATGACAGAGCCGTGTTT GACAAGATTGTTT CTAAAC TACTTAATCT CGGC CT TATC ACAGAAAGC 

111 CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAftAATTAATCTCAAAGGAAGCCAACAA 
TTAT GAGGAGGATCCCAATAAGCCCACAAGCTGGAC T GAGAATCAGGCTGGAAAAATACCAGAGAAAG 
T GACTCCAATGGCAGCAAT TCAAGATGGTCT T GCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
T TAACCT T GACAAATGGCT TGGAAAGGAGAACTAAAACCTACAGT GAAGACAACTTTGAGGAACTCCA 
ATATTTCCCAAATTTCTATGCGCTACT GAAAAGTAT TGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACT GAT TACTAT CATGAAAACACTGATTGACTTTGTGAAGATGATGGTGAAATATGGAACAATATCT 
CC AGAAGAAGG T GTT T C CT AC C T T GAAAACT TG GAT GAAAT GAT T GC T C T TCAGACCAAAAACAAGCT 
AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATC^GAGAAGAGTCATGAAGAAA 
C AG ACAG T AC C AAGG AAG AAG C AG C T AAG AT G GAAAAG GAAT AT G GAAG C T T G AAG GAT T C CACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAATATT GAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGAT GAGAGACTT CAT CAATAAACAAGCTGATGCTTATGT GGAGAAAGGCATCCTT GACAAGGAA 
GAAGC C GAGGC C AT CAAGC GC AT T TAT AG CAGC CT G TA&&AAT GG CAAAAGAT CC AGGAGT C T TT CAA 
CTGTTTCAGAAAACATAATAT AGCTTAAAACACTTC TAAT TCT GT GATTAAAATTT TTT GACCCAAGG 
GTTAT TAGAAAGT GCT GAATT TACAGT AGTTAACCT T TTACAAGT GGTTAAAACATAGCT TTCTT CCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCT GAAAAAAAAAAAAAAAAAAAA 



m 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEASEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTL I DFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE lsi 



CGGCT CGAGGCTCCCGCCAGGAGAAAGGAACATTC TGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
P CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGC TC TCCACATGAAGTC CT G T CAC TCACCAC T GT GCAGGAGAGGGAGG T GGT CATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
Ml TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
||| GAAAT GGCT CGAGC TCAGAAGATAAAAGATAAG TAGGG T AT GCT GATCCTCT TT TAAAAACCCAA 

ffj GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
AGACCTACTTACAAAGTGGCATATATTGCAATT TATT TTAATTAAAAGATACCTATTTATATAT T 
TCT T TATAGAAAAAAG TC T GGAAGAGT T TAC T TCAAT TGTAGCAATGTCAGGGT GGTGGC AGTAT 



0! 
W 



& AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 

If! 



ri 



AT T C C T T G T ATAAAAATAA.GAAAAGAAA.T T AAT C T T GAG G T AAGCAGAGC AGAC ATCAT C T C T GA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
||f G TGGC TGGAAT C TC TGGGTAAGGAACT TAAAGAACAAAAAT CAT CT GGT AAT TCT TT C CTAGAAG 

M GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
f!f ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 

GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GACTGGT TTCCT TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGATGAAGGCAGAGATCGGAGTTTT GCAGCCACAAGCTAAGAAACACCAAGGAT TGTGGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
CTGAAACCT TAATC TCAGACT T CCAGCCTCC T GAACGAAGAAAGAATAAAT T TCGGCT GT T T TAA 
GCCACCAAGGATAAT TGG T TACAGCAGCTCTAGGAAACTAATACAGC TGC TAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC TAGCT GCCAT GC TATGAGCAGGCCTATAAAGAGACT TACGTGGTAAAAAATGAAG T C T CCT 
GCCCACAGC CACAT TAGTGAACCT AGAAGCAGAGACT C T G T GAGATAATCGATGT TTGT T GT T T T 
AAGT T GC TC AGT T T TGGT C TAAC T T GTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 1*2 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPWIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAP I TDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature . 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 



C T T CAG AAC AGG T T C T CC T T C C CC AG T CAC C AG T T G C T C GAG T T AGAAT T G '1' C 1 G C A ATGG CC GC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGTGGAGAGAT CAAAGCAAT TGGAGAACTGGAT TT GCTGT T TAT GTCTCT GAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAAT T AGAT GCCCCAAAGCGAT T T T T T TTAACCAAAAGGAAGATGGGAAGCCAAAC TC CAT CAT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGAC TT TCTAAGCATAGATAT TTAT TGAT AACAT T TCATT GT AACT GGTGT TC 
TAT AC AC AGAAAACAATT TAT T T TT TAAATAAT TGTC TTT T TC CATAAAAAAGAT T AC TTTCCAT 
TCCT T TAGGGGAAAAAAC CC CTAAATAGC T T CATGT T TC CATAATCAGTACT TTAT AT T T ATAAA 
TGTAT TTAT TAT TAT T AT AAGAC T GC AT T T TAT T TAT AT CAT T T TAT T AAT AT G GAT T TAT T TAT 
AGAAACAT CAT TCGATATTGCTACT T GAGT G TAAGGCTAATAT TGATAT TTATGACAATAAT TAT 
AGAGCTATAACATGT TTAT T TGACCT CAATAAACACT TGGATATCCC 
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FIGURE 1*4 

MAALQKSVSS FLMGTLATSCLLLLALLVQGGAAAP I SSHCRLDKSNFQQPYI TNRTFMLAKEASL 
ADNKTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites . 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
^ CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 

f% GCC CAAC CGCCACCCAGAG T CCTGTAGGGCCAGTGAAGATGGACCCC T CAACAGCAGGGCCATCT 

his 

p CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
U] TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
['I GCT GC TCTACCACAAC CAGACT GT C T TCTACAGGCGGCC AT GCCATGGCGAGAAGGGCACCCACA 

j£| AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
gg GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
s ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
f| CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 

CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
P GCCCAGCACAGGCACT T TCTAGATAT T TCCCCCT TGCTGGAGAAGAAAGAGCCC CT GGT T T TAT T 

f|| TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 

T T GACAT GGAT GATTCTGAGGAGGAAGCTGT TAT TGAAT GT AT AGAGAT TTATCCAAATAAATAT 
C T T TAT T TAAAAATGAAAAA 



0 
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FIGURE 1F»6 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

In^>ortant features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites . 

amino acids 44-50, 150-156 
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FIGURE 1^7 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACC TCCGAGTAGAACC T G T TACAACTAGTGT TGCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT TT GT GT GACGGGCAAAAGCAACT T CCAGTCCTACAGCT GT GTGAGGT GCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATT TCATT GGGGCCCATAATATT CCTAATGCAAATATGAATGAAGATGG 
C C C T T CC ATG T C T G T GAAT T T C AC C T CAC C AGGC T GC C T AGAC CAC ATAAT G AAAT AT AAAAAAA 
AG T GTGT CAAGGCCGGAAGCCT G T GGGATCCGAACATCACTGC T TGTAAGAAGAAT GAGGAGACA 
G TAGAAGTGAAC T TCACAACCACTCCCCT GGGAAACAGATACATGGCTCT TATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CAT GGGT GCTGGTGGCAGGGAT C TAT C TAATGT GGAGGCAC GAAAGGATCAAGAAGAC T TCCT T T 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
T CACACAATT TG TTAC T TCAG T GAAT T TC T TGAAAAC CATT GCAGAAG TGAGGT CATC CT TGAAA 
AGT GGCAGAAAAAGAAAATAGCAGAGATGGGTCCAG T GC AGTGGCT TGCCAC TCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
G TGAT CTAAGAAGCCAGAT T CATC TGCACAAATACG TGGTGGTC TAC T T TAGAGAGAT TGATACA 
AAAGACGATTACAAT GCT CT CAG TGT C TGCCCCAAGTACCACC T CATGAAGGATGCCACTGCT T T 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 

MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSIIJyiNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKC^ 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYIM^RHERIBCKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILE KWQ KKKI AEMGPVQWL AT Q KKAADKW FLL S ND VN SVC DGT CG KS EGSPSENSQDLF PL A 
FNLFCSDLRSQIHLHKYVWYFREIDTECDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-royristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCA^CATGACAGTGAA.GACCCTGCATGGCCCAGCCATGGTCAA.GTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
A T CAT CAAT GAAAAC C AG C GC GT T T C CAT GTCACG T AACAT CGAGAGC CG C T C C ACC T CC CC C T G 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCA GTAAG AGGTGCATATCC 
ACT CAGCTGAAGAAG 
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FIGURE 160 

MTWTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGT T TGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
C GG GAC T TAAA T AAAGGC AGAC GCTGTTTTTC T AAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHIAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEECELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFKEDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWAC PMDKY I HKRWALVWLACLL FAAALS LILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLL Y S ADDS GF E RL VGALASALCQLPLRVAVDLWS RREL S AQG P VAW FHAQ RRQTLQEGGVWLL F S P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLH PDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCT CTGTGCCAGCCCCG ATGA GGACGC TGCT GACCAT CT TGACTGTGGGATC CC TGGC T 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACATC CT GACGTGGGACAGC GGGC CAGAGGGCACCCCAGACACGGTC TACAGCATCGAGTATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

G T CAGT GCGGGAGGCCGG TC AGCCACCAAGATGAC TGACAGG T T CAGC T C TC TGCAGC ACAC TAC 

CC TCAAGCCACC TGATGTGACCTGTATCTCCAAAG TGAGATCGAT T CAGATGAT TGT T CATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TCTACCACTTAGAGCTCCAGGTCAACCGCACCTACCAAATGCACCT TGGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CAC C T T AGG CC TAAAGG T CAG C T T CAGAAAGAG CCAC CAGC T G GAAG C T GCAT GT TAG G T G GC C T 

T T CT C T GCAGGAGG T GAC C T CCT T GGCTATGGAGGAATCCCAAGAAGC AAAATCAT TGCACCAGC 

CCCTGGGGATTTGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGG G AG T G GCAT GCAGG GCC CCT G CCAT GGG T GCG C T CC TCACC GGAACAAAGCAG CAT GAT A 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCCTGGAAAAGAACCAGGCCT GGAAAAGAACCAGAAGGAGGCT GGGCAGAAC CAGAAC AACCT GC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 

GAAAT GCAGGTCCACCAGGGAGGGAGACACACAAGCC T T T T C T GCAGGCAGGAGT T TCAGACCCT 

ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

C T CACCAC TCACAAGC TG TGTGAC T T CAAACAAAT GAAATCAGTGCCCAGAACC TC GGT T TCC TC 

ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTT TAAAGTGC TTAATAGT GCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATT TAAAAA 

AAAAAAAA 
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FIGURE 16a 

MRT LLT I LTVGS LAAHAPEDPSDLLQHVKFQS SNFENI LTWDS GPEGTPDT VYS IEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGT IMI CVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFLFSMGFLVAYLCYLS YRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
S PSDQGPS PWGLLESLVCPKDEAKS PAPET SDLEQPTELDS LFRGLALTVQWE S 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites, 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites . 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE l6s 

MY4f^Y-M'flf'1T4f;AAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACiyVC ^TGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGT G T TGAACACAGT TT TGGATAT T T TCCAAAAGAT T TGAT CAAGG TACT TCATAAATACACGGA 
AGAAGAGC TACATAT TCCAGCAGAT GAGACAGACT T T GT CT GC T TTGAAGGAGGAAGAGAT GAT T 
T TAAT AGT T ATAATGTAGAAGAGCT T T TAGGAT CT T T GGAACT GGAGGAC TC TGTACC TGAAGAG 
TCGAAGAAAGCT GAAGAAGTT T CT CAGCACAGAGAGAAAT C TCCTGAGGAGTCT CGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGC TGCAGGGACAGCCCT CAGCTCAGGAGAGCCACCC TCACAC CAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCT TACAAAGTC CT GAAAACAGAAAT GAGTCAGAGAGGAAGTGGACAGTGCGT TAT T 
CATTACAGCAAAGGATTTCGTTGGCATCAAAATCTAAGTTTGTTTTACAAAGATTGTTTTTAGTA 
CTAAGCTGCCT TGGCAGT T T GCATT T T TGAGCCAAACAAAAAT ATAT TAT T T TC CCTT CT AAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWL FVLGALWWPGQS DLSHGRRFSDLKVCGDEECSMLMYRGKALED FTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDLI KVLHKYTEEELHI PADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSE-DGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKT DAYKVLKTEMS QRGS GQCVIHYSKG FRWHQNL S L FYKDC F 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 



01 



N-glycosylation site. 

amino acids 294-298 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 




amino acids 30-34 



Tyrosine kinase phosphorylation site. 



amino acids 67-76 



N-rayristoylation sites* 

amino acids 205-211, 225-231, 277-283 



Amidation site. 



amino acids 28-32 
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FIGURE 167 



C CAGGAC CAGGGCGCACCGGC T CAGCCT C TCAC TT GT CAGAGGCCGGGGAAGAGAAGC AAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCG ATGA GCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTGA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
T GC CT GC CCAGATCTCTACCAGTGGT CTGATGGAAGCAAT TCC CAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
C TT GGGGGTCCCTACCT T TACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTAT GAACCAGAGAT TAAT CCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAAC CAGT CT ACACT GT GGAT T T CAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGTATAATAACTCAT TGACT T GGT T C CAGAAT T T TGTAAT TCT 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGAT GAACTGTAAGC TCCCCCT TGAGGCAAATATTAAAGTAAT TT TTATATGT C TAT TAT T TCA 
T T T AAAGAATAT GC TGTGC T AATAAT GGAGTGAGACATGC T TAT TT TGC TAAAGGATGCACCCAA 
ACT T CAAACT TCAAGCAAATGAAAT GGACAATGCAGATAAAGT T GT TATCAACACGTCGGGAG TA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
T G TATAT TGTAT TGAAAT T TACAGTGTGCAAAAGTATT TTACC T T T GCATAAGTGTTT GATAAAA 
ATGAACT GT TCTAATATT TATT T T TATGGCAT C TCAT T TT T CAATACATGC TCT T T TGATT AAAG 
AAACT TAT T ACT GTT GTCAACT GAAT TCACACACACACAAATATAGTACCATAGAAAAAGT T TGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AAT AAGAAG CTAT T TCAT TAAGTGTGATATAAACC T CCT CAAACATT T TAC T TAGAGGCAAGGAT 
T GT C TAAT T TCAAT T GTGCAAGACAT G TGCC T TATAAT TAT TT TTAGC T TAAAATT AAACAGATT 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
TGAC^TACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
AT T GACACT GGAGGCAGATAG T TGCAAAG T TAGTCTAAGG T TT CCC TAGC T G TAT T TAGCC T C TG 
ACT AT AT TAGTATACAAAGAGGT CAT G TGGTTGAGAC CAGGTGAATAG TCAC TATCAGT GTGGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAACTACGAAAT CGTGTGAAAAT GGGTTGG 
AACCCAT CAGTGATCGCATAT T CAT T GATGAGGGTT T GCT T GAGATAGAAAATGG T GGC T CC T TT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWS LLLGAALLCGHGAFCRRWSGQKVC FADFKHPCYKMAYFHELS S RVS FQEARLACES E 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 



| ; j N-glycosylation sites. 

f|| amino acids 86-89, 255-258 



W : l cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 



p N-myristoylation sites. 

f|| amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 

145, 212-217 
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